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METHOD AND SYSTEM FOR SHAPING TRANSMITTED POWER SPECTRAL 
DENSITY ACCORDING TO LINE CONDITIONS 

CROSS REFERENCE TO RELATED APPLICATION 

This application claims priority from provisional application Serial No. 
60/308,587, filed July 31, 2001, entitled Method and System for Duplex Symmetric 
Transmission, which is incorporated by reference. 

FIELD OF THE INVENTION 

The present invention relates generally to communication systems providing 
duplex symmetric transmission, more particularly, to optimizing performance of non- 
DMT-based DSL by shaping transmitted power spectral density according to line 
conditions. 

BACKGROUND OF THE INVENTION 

Traditionally, a modem converts data between the analog form, used for 
communicating over telephone lines, and the digital form, used on computers for the 
purposes of computer processing and computer-to-computer communication. Generally, 
standard modems may transmit data at a maximum rate of 56,000 bits per second (bps) or 
56kbps. However, inherent limitations of phone systems may translate to lower modem 
speeds and other limitations. Modems at a transmitting end modulate the digital data of 
computers into analog signals to send over telephone lines, such as Plain Old Telephone 
System (POTS). Then, modems at a destination receiving site demodulate the analog 
signals back into digital signals to be read by a destination computer on the other end. 
There are standards to ensure that modems made by different manufacturers can 
communicate with each other. For example, modems communicating with each other 
may be required to use the same speed and comply with other requirements. 

More recently, modems for cable and Digital Subscriber Line (DSL) service have 
come to be known as digital modems while those used for traditional dial-up networking 
are referred to as analog modems. DSL technology provides high-speed, broadband 
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network connections to homes, businesses and other users. DSL utilizes the same cabling 
used for normal telephones, while offering higher data rates and other advantages through 
the use of digital modem technology. 

G.SHDSL is a standard that enables manufacturers and other entities to develop 
5 Central Office (CO) loop access equipment and Customer Premises Equipment (CPE) 
around a single standard, thereby increasing market share and decreasing component 
costs. As a symmetric multi-rate DSL, G.SHDSL can operate over a single pair of 
copper wires. For speed versatility, the technology can also be deployed over dual 
copper pairs. G.SHDSL has associated with it a global standard developed by the 
10 International Telecommunications Union's (ITU) Telecommunications Standards Sector 
based in Geneva. Higher bit rate and longer copper transmission line (or loop) are 
additional factors that strengthen G.SHDSL. 

G.hs (handshake) protocol negotiates the highest achievable data rate given the 
loop conditions. Using the G.hs protocol during pre-activation, service type may be 
15 negotiated during start-up (e.g., training). With this protocol, the most efficient framing 
type may be negotiated to avoid unnecessary overhead and latency on the DSL link. 

Optimization of various factors, such as power back off, determination of data 
rates, filter lengths, transmit power spectral density, and other factors, may affect modem 
and system performance. 
20 Traditional methods for determining power back off (PBO) are generally 

computed in the time domain. Similarly, current methods for determining an optimal 
data rate involve calculating SNR in the time domain and providing an estimate of the 
capacity. Traditional methods for dynamically adapting the length of a filter result in 
higher costs and power consumption. Estimation of a base-2 logarithm of a number 
25 generally involves a separate polynomial for each range of input numbers or an 
expansion requiring a large number of terms, which are oftentimes complicated and 
difficult to implement. 

Therefore, there is a need in the art of modem systems for a more efficient method 
and system for providing efficient communication between modems. 

30 
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SUMMARY OF THE INVENTION 

Aspects of the present invention overcome the problems noted above, and realize 
additional advantages. One such inventive aspect provides methods and systems for 
improving the performance of a pair of non-DMT-based DSL modems by choosing 
5 transmitted PSD's according to the line conditions. In addition, the signal-to-noise ratio 
(SNR) of both modems may be above some minimum SNR needed for the target bit error 
rate (BER). Further, the margin levels specified in the respective standard are satisfied. 

According to an aspect of the present invention, a method for shaping transmitted 
power spectral density based on line condition data comprises the steps of determining 
10 first line condition data associated with a first modem; determining second line condition 
data associated with a second modem; exchanging, respectively, the first and second line 
M condition data between the first modem and the second modem; and shaping a transmit 

ri spectrum for each of the first modem and the second modem in response thereto. 

Other features of the present invention include wherein the transmit spectrum of 
In 15 the first modem and the transmit spectrum of the second modem are shaped differently; 
JJ1 wherein the transmit spectrum of the first modem and the transmit spectrum of the second 

■ modem are shaped substantially the same; further comprising the step of identifying one 

j I or more interfering signals that interfere with data received by each of the first modem 

J I and the second modem where the interfering signals comprises one or more of noise and 

O 20 echo; wherein the steps are performed prior to a step of modem training; wherein the 

if i! f. 

steps of determining line condition data further comprise the steps of determining a 
plurality of signal power values associated with at least one of the first modem and the 
second modem, determining a plurality of noise power values associated with at least one 
of the local modem and the remote modem, and determining a plurality of signal to noise 

25 ratio values associated with at least one of the local modem and the remote modem; 
wherein line condition data comprises noise data and loop data; wherein noise data 
occupy a range of values between high noise and low noise and wherein loop data occupy 
a range of values between short loop and long loop; further comprising the step of 
reducing transmitted power for a predetermined plurality of frequencies for reducing 

30 uncancelled echo power at a first modem wherein the first modem is a local modem; 
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wherein a maximum loop length is extended for which a signal to noise ratio is 
maintained above a minimum signal to noise ratio; wherein a noise margin is increased 
on a loop for which the uncancelled echo power reduces the noise margin, wherein the 
noise margin comprises an amount of noise that is added after at least one of a first 

5 modem and a second modem achieves steady-state while maintaining a signal to noise 
ratio above a minimum signal to noise ratio; wherein the uncancelled echo power is 
reduced because of a reduction in one or more effects of a nonlinear echo channel 
producing noise at frequencies other than a transmitted frequency; wherein the 
uncancelled echo power is reduced because of a reduction in a required number of echo 

10 canceller filter taps; further comprising the step of increasing transmitted power for a 
predetermined plurality of frequencies for increasing a signal received at the second 
modem, the second modem being a remote modem, wherein the received signal at the 
remote modem is determined by the signal transmitted by the first modem, the first 
modem being a local modem, and loop attenuation; further comprising the step of 

15 minimizing a number of transmit filters to be stored by using at least one transmit filter 
with a plurality of normalized parameters that are normalized to a data rate; wherein the 
normalized parameters comprise a cutoff frequency of the transmit filter; further 
comprising the steps of implementing a first power spectral density for optimizing noise 
performance and implementing a second power spectral density for optimizing reach 

20 performance; further comprising the step of interpolating between the first power spectral 
density and the second power spectral density; wherein the first line condition data and 
the second line condition data are from a common line; wherein the first line condition 
data and the second line condition data are from different lines; wherein one or more of 
the first line condition data and the second line condition data are derived from cross talk; 

25 wherein one or more of the first line condition data and the second line condition data 
comprise a frequency-independent reduction in power determined by calculating a signal 
to noise ratio using a geometric mean for a sub-band of a total frequency band for a 
plurality of rates further comprising the step of choosing a transmitted power for a 
predetermined plurality of frequencies to simultaneously balance reducing the 

30 uncancelled echo power at the local modem while increasing a received signal at the 
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second modem, the second modem being a remote modem; and further comprising the 
step of choosing a transmitted power for a predetermined plurality of frequencies to 
simultaneously balance reducing uncancelled echo power at the first modem, the first 
modem being a local modem, while increasing a received signal at the second modem, 

5 the second modem being a remote modem. 

According to another aspect of the present invention, a system for shaping 
transmitted power spectral density based on line condition data comprises a first 
determining module for determining first line condition data associated with a first 
modem; a second determining module for determining second line condition data 

10 associated with a second modem; an exchanging module for exchanging, respectively, 
the first and second line condition data between the first modem and the second modem; 
and a shaping module for shaping a transmit spectrum for each of the first modem and the 
second modem in response thereto. 

Other features of the present invention include wherein the transmit spectrum of 

15 the first modem and the transmit spectrum of the second modem are shaped differently; 
wherein the transmit spectrum of the first modem and the transmit spectrum of the second 
modem are shaped substantially the same; an identifying module for identifying one or 
more interfering signals that interfere with data received by each of the first modem and 
the second modem where the interfering signals comprises one or more of noise and 

20 echo; wherein the system is operated prior to modem training; a signal power 
determining module for determining a plurality of signal power values associated with at 
least one of the first modem and the second modem, a noise power determining module 
for determining a plurality of noise power values associated with at least one of the local 
modem and the remote modem, and a signal to noise ratio determining module for 

25 determining a plurality of signal to noise ratio values associated with at least one of the 
local modem and the remote modem; wherein line condition data comprises noise data 
and loop data; wherein noise data occupy a range of values between high noise and low 
noise and wherein loop data occupy a range of values between short loop and long loop; a 
reducing power module for reducing transmitted power for a predetermined plurality of 

30 frequencies for reducing uncancelled echo power at a first modem wherein the first 
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modem is a local modem; wherein a maximum loop length is extended for which a signal 
to noise ratio is maintained above a minimum signal to noise ratio; wherein a noise 
margin is increased on a loop for which the uncancelled echo power reduces the noise 
margin, wherein the noise margin comprises an amount of noise that is added after at 

5 least one of a first modem and a second modem achieves steady-state while maintaining a 
signal to noise ratio above a minimum signal to noise ratio; wherein the uncancelled echo 
power is reduced because of a reduction in one or more effects of a nonlinear echo 
channel producing noise at frequencies other than a transmitted frequency; wherein the 
uncancelled echo power is reduced because of a reduction in a required number of echo 

10 canceller filter taps; an increasing power module for increasing transmitted power for a 
predetermined plurality of frequencies for increasing a signal received at the second 
modem, the second modem being a remote modem, wherein the received signal at the 
remote modem is determined by the signal transmitted by the first modem, the first 
modem being a local modem, and loop attenuation; a minimizing module for minimizing 

15 a number of transmit filters to be stored by using at least one transmit filter with a 
plurality of normalized parameters that are normalized to a data rate; wherein the 
normalized parameters comprise a cutoff frequency of the transmit filter; wherein a first 
power spectral density is implemented for optimizing noise performance; and a second 
power spectral density is implemented for optimizing reach performance; an interpolating 

20 module for interpolating between the first power spectral density and the second power 
spectral density; wherein the first line condition data and the second line condition data 
are from a common line; wherein the first line condition data and the second line 
condition data are from different lines; wherein one or more of the first line condition 
data and the second line condition data are derived from cross talk; wherein one or more 

25 of the first line condition data and the second line condition data comprise a frequency- 
independent reduction in power determined by calculating a signal to noise ratio using a 
geometric mean for a sub-band of a total frequency band for a plurality of rates; a 
selecting module for choosing a transmitted power for a predetermined plurality of 
frequencies to simultaneously balance reducing the uncancelled echo power at the local 

30 modem while increasing a received signal at the second modem, the second modem being 
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a remote modem; and a selecting module for choosing a transmitted power for a 
predetermined plurality of frequencies to simultaneously balance reducing uncancelled 
echo power at the first modem, the first modem being a local modem, while increasing a 
received signal at the second modem, the second modem being a remote modem. 
5 The accompanying drawings, which are incorporated in and constitute a part of 

this specification, illustrate various embodiments of the invention and, together with the 
description, serve to explain the principles of the invention. 

BRIEF DESCRIPTION OF THE DRAWINGS 

10 The present invention can be understood more completely by reading the 

following Detailed Description of the Invention, in conjunction with the accompanying 
drawings, in which: 

FIG. la is a block diagram illustrating an example of a line probe session, 
according to an embodiment of a first aspect of the present invention. 

15 FIG. lb is a flowchart illustrating a line probe session for STUR, according to an 

embodiment of a first aspect of the present invention. 

FIG. lc is a flowchart illustrating a line probe session for STUC, according to an 
embodiment of a first aspect of the present invention. 

FIG. Id is timing diagram, according to an embodiment of a first aspect of the 
20 present invention. 

FIG. 2 is a flowchart illustrating a line probe session with a sub-band SNR 
calculation, according to an embodiment of the first aspect of the present invention. 

FIG. 3a is a chart illustrating a noise scenario, according to an embodiment of the 
first aspect of the present invention. 

25 FIG. 3b is a chart illustrating a noise scenario, according to an embodiment of the 

first aspect of the present invention. 

FIG. 4a is a chart illustrating a sub-band SNR, according to an embodiment of the 
first aspect of the present invention. 
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FIG. 4b is a chart illustrating a sub-band SNR, according to an embodiment of the 
first aspect of the present invention. 

FIG. 5 is a flowchart illustrating a line probe session with a sub-band capacity 
calculation, according to an embodiment of a second aspect of the present invention. 

FIG. 6 is a plot representing normalized SNR vs. noise power and channel 
attenuation, according to an embodiment of a fifth aspect of the present invention. 

FIG. 7 is a block diagram of a G.SHDSL system, according to an embodiment of 
the fifth aspect of the present invention. 

FIGs. 8-11 are charts illustrating echo channel frequency responses, according to 
an embodiment of the fifth aspect of the present invention. 

FIG. 12 is a chart illustrating transmit PSD that reduces nonlinear effects at lower 
frequencies, according to an embodiment of the fifth aspect of the present invention. 

FIG. 13 is a flowchart illustrating a filter selection method, according to an 
embodiment of the fifth aspect of the present invention. 

FIG. 14 is a system diagram of a communication network, according to an 
embodiment of the present invention. 

FIG. 15 is a chart illustrating echo canceller filter taps, according to an 
embodiment of a third aspect of the present invention. 

FIG. 16 is a chart illustrating echo canceller filter taps, according to an 
embodiment of the third aspect of the present invention. 

FIG. 17 is a chart illustrating a polynomial curve used to approximate echo 
channel length, according to an embodiment of the third aspect of the present invention. 

FIG. 18 is a chart illustrating a linear curve used to approximate echo channel 
length, according to an embodiment of the third aspect of the present invention. 

FIG. 19 is a block diagram illustrating basic operations of an echo canceller, 
according to an embodiment of the third aspect of the present invention. 
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FIG. 20 is a flowchart illustrating a software algorithm, according to an 
embodiment of the third aspect of the present invention 

FIGs. 21 and 22 are charts illustrating a fractional part of a log, according to an 
embodiment of a fourth aspect of the present invention. 

5 FIGs. 23-26 are charts illustrating errors of different approximations in terms of 

dB, according to an embodiment of the fourth aspect of the present invention. 

FIG. 27 is a block diagram of a digital circuit to generate an integer part and 
estimate of a fractional part of Log-2, according to an embodiment of the fourth aspect of 
the present invention. 

10 FIG. 28 is a block diagram of a digital circuit to implement a second order 

polynomial, according to an embodiment of the fourth aspect of the present invention. 

FIG. 29 is a schematic diagram of a hardware architecture in which the inventive 
aspects of the present invention may be incorporated. 

FIG. 30 is a block diagram illustrating details of SNR margin, according to an 
15 embodiment of the present invention. 

FIG. 31 illustrates a flowchart describing hardware algorithm for SNR margin, 
according to an embodiment of the present invention. 

FIG. 32 is a schematic diagram of a hardware architecture in which the inventive 
aspects of the present invention may be incorporated. 

20 LIST OF ACRONYMS 

• ADC - Analog Digital Converter 

• AFE - Analog Front End 

• AGC - Automatic Gain Control 

25 • ANSI - American National Standard Institute 

• AR - autoregressive 

• ARMA - autoregressive moving average 

• ATM - Asynchronous Transfer Mode 

• AWG - American Wire Gauge 

30 • AWGN - additive white and gaussian 

• bps - bits per second 
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• BER - bit error rate 

• CO- Central Office 

• CPE - Customer Premises Equipment 

• CPU - central processing unit 
5 • CRL - clock recovery loop 

• DAC - Digital Analog Converter 

• DAGC - Digital automatic gain control 

• DAV - Digital Audio Video 

• DFT - discrete Fourier transform 
10 • DMT - Discrete Multi-Tone 

• DSL - Digital Subscriber Line 

• DSLAMs - DSL Access Multiplexers 

• EC - echo canceller 

• EPL - estimated power loss 
15 • EQ- Equalizer 

• ETSI - European Telecommunications Standard Institute 

• FF - Feed Forward 

• FIR - Finite Impulse Response 

• FFT - Fast Fourier Transform 
20 • G.hs - handshake 

• HDSL - High Speed Digital Subscriber Line 

• IAD - Integrated Access Devices 

• ITU - International Telecommunications Union 

• IV-REF - Current and Voltage Reference Generator 
25 • LA - loop attenuation 

• LP - line probe 

• LSB - least significant bit 

• MA - moving average 

• MIPS - Million Instructions Per Second 
30 • MMSE - Minimum Mean Squared Error 

• MSB - most significant bit 

• MSE - mean squared error 

• NEXT - near end cross talk 

• NEXT PSD - near end cross talk power spectral density 
35 • PAC - programmable attenuation control 

• PAR - peak-to-average ratio 

• PBO - power back off 

• PGA - programmable gain amplifier 

• PHY - Physical Layer Device 
40 • PLL - Phase Locked Loop 

• POTS - Plain Old Telephone System 

• PSD - power spectral density 
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• QMF - quadrature mirror filter 

• RISC - Reduced Instruction Set Computing 

• ROI - rate of interest 

• Rx - Receive 

5 • S-D - Sigma-Delta 

• SDSL - Synchronous Digital Subscriber Line 

• SNMP - Simple Network Management Protocol 

• SNR - signal to noise ratio 

• SOHO - small office/home office 
10 • STM - Synchronous Transfer Mode 

• STUC - SHDSL Transceiver Unit - Central Office 

• STUR - SHDSL Transceiver Unit - Remote End 

• TPS-TC - Transmission Protocol Specific - Transmission Convergence 

• Tx - Transmit 

15 • USB - Universal Serial Bus 

• VCXO DAC - Volt Controlled Oscillator Digital Analog Converter 

DETAILED DESCRIPTION OF THE INVENTION 

20 The following description is intended to convey a thorough understanding of the 

invention by providing a number of specific embodiments and details involving modems 
applications. It is understood, however, that the invention is not limited to these specific 
embodiments and details, which are exemplary only. It is further understood that one 
possessing ordinary skill in the art, in light of known systems and methods, would 

25 appreciate the use of the invention for its intended purposes and benefits in any number 
of alternative embodiments, depending upon specific design and other needs. 

An embodiment of the present invention is directed to determining maximum 
power backoff for a G.SHDSL modem using frequency domain geometric signal to noise 
ratio (SNR). A G.SHDSL standard may specify a minimum power back off (PBO) for 

30 modem implementation. Power back-off may be specified as an amount of PBO in dB 
for an estimated line loss. Although the standard specifies a minimum back off, it is 
desirable to be able to increase the PBO beyond the minimum. The reasons for this may 
include reduced power consumption and reduced crosstalk generated by a modem. An 
aspect of the present invention discloses a method and system for determining an 

35 absolute maximum power PBO that may be tolerated and still meet bit error rate (BER) 
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and/or other requirements. According to another embodiment, the present invention 
implements a geometric mean to compute SNR in a frequency domain over a pass-band 
of a transmit spectrum. 

During a line probe session, a G.SHDSL modem may determine, among other 

5 things, the level of PBO that the modem may support given the line conditions. FIG. la 
illustrates a line probe (LP) session for G.HS (e.g., 114, 116) between SHDSL 
Transceiver Unit-Central Office (STUR) 110 and SHDSL Transceiver Unit-Remote End 
(STUC) 1 12, which may last a maximum of approximately 10 seconds, for example. 

During the transmission of each probe signal, represented by Prx (e.g., Prl, Pr2, 

10 Pr3) and Pcx (e.g., Pel, Pc2) in FIG. la, the modem transmitting the probe may train an 
associated automatic gain control (AGC) and echo canceller (EC). It may then measure a 
residual echo signal and use this signal as an estimate of the noise spectrum. In addition, 
the modem may measure a silence power spectrum during segments where no signals are 
transmitted. This measurement may be used for a noise spectrum estimate. The modem 

15 receiving the probe signal may measure its received signal and noise spectrum. One 
modem may transmit a probe signal at any given time. 

When the modems are not measuring received signals, the modems may calculate 
a sub-band SNR for each rate using the methods described below and assign a PBO value 
for the given rate. 

20 FIG. lb is a flowchart illustrating a line probe session for STUR and FIG. lc is a 

flowchart illustrating a line probe session for STUC. 

In FIG. lb, a line probe session for STUR may be initiated at start 120. At step 
122, a first pre-activation handshaking may be performed. At step 124, a line probe may 
be initiated. If the line probe is initiated, silence power may be measured, at step 126. 

25 Probe signals may be sent to STUC, at step 128. Probe signals may be received from 
STUC, at step 130. Capacity, Power back off (PBO) sub-band and signal-to-noise (SNR) 
may be determined, at step 132. Other calculations and/or factors may be determined as 
well. A second pre-activation handshaking may be performed at step 134. Cr (which 
represents a remote unit training signal) Automatic Gain Control (AGC)/echo canceller 

30 (EC) training may be transmitted, at step 136. Clock recovery loop (CRL) training may 
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be initiated at step 138. Sc (which represents a central office unit training signal) may be 
detected and CRL training may continue, at step 140. Digital automatic gain control 
(DAGC) training may be performed, at step 142. Equalizer (EQ) training may be 
performed, at step 144. Tc (which represents a central office unit training signal) may be 

5 detected, at step 146. Tr (which represents a remote unit training signal) may be 
transmitted, at step 148. Fc (which represents a central office unit training signal) may be 
detected, at step 150. At step 152, steady state may be achieved. 

In FIG. lc, a line probe session for STUC may be initiated, at start 160. At step 
162, a first pre-activation handshaking may be performed. At step 164, a line probe may 

10 be initiated. If the line probe is initiated, silence power may be measured, at step 166. 
Probe signals may be received from STUR, at step 168. Probe signals may be sent to 
STUR, at step 170. Capacity, PBO, and SNR may be determined, at step 172. Other 
calculations and/or factors may be determined as well. A second pre-activation 
handshaking may be performed, at step 174. Cr may be detected, at step 176. Sc 

15 AGC/EC training may be transmitted, at step 178. Sc may be detected, at step 180. 
DAGC training may be performed, at step 182. Equalizer (EQ) training may be 
performed, at step 184. Tc may be transmitted, at step 186. Tr may be detected, at step 
188. Fc may be transmitted, at step 190. At step 192, steady state may be achieved. 

FIG. Id is an example of a timing diagram of an activation sequence. As 

20 illustrated, STUR may initiate Cr, lasting a duration of t cr , which has a nominal value of 1 
second with ! 20 millisecond tolerance. Time from the end of Cr to a beginning of Sc is 
represented by t^sc, which has a nominal value of 500 millisecond with ! 20 millisecond 
tolerance. After a time t crsc , STUC may initiate Sc. Time from the end of Cr to a 
beginning of Sr is represented by t crsr , which has a nominal value of 1.5 second with ! 20 

25 millisecond tolerance. After a time t C r S r> STUR may initiate Sr. After Sc, STUC may 
initiate Tc. After Sr, STUR may initiate Tr. After Tc, STUC may initiate Fc. At 
approximately the same time, Datac and Data r may be initiated by STUC and STUR, 
respectively. Time from the beginning of Cr to the beginning of Data r is represented by 
tActdata* which has a nominal value of 15 seconds. 
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If the SNR is calculated in the time domain, one method to determine PBO is 
according to the equations shown below. 



SNR dB = lOlogio 



signal + noise 

P 

noise 



lOlog 



10 



( M-1 A 

5>(«)+w(«)]l 2 

n=0 



PBO dB = SNR d B - (so + ca + SNR min ) 
(2) 



M-1 

n=0 



(i) 



s(n) = sample of the received signal 

w(n) = n th sample of the received noise 

M = window length in samples used to compute average 

P si gn ai + noise = power of signal + noise 

P n0lse = power of noise only 

where so represents a required margin in dB (> 0 dB, example: G.SHDSL Annex 
10 B margin is 6 dB); SNR m in represents a minimum SNR in dB needed to obtain the 
specified BER, and o& represents an implementation loss in dB. 

A problem with this estimate is not being localized in frequency. Finding the 
noise and signal powers as shown in (1) and (2) above may be the equivalent of 
integrating under the entire frequency domain PSD. For a sub-band of the total 
15 frequency band, this estimate may not produce the desired results. 

According to an embodiment of the present invention, PBO selection may be 
based on a sub-band SNR calculation. FIG. 2 illustrates a flowchart for a line probe 
session with a sub-band SNR calculation, according to an embodiment of the present 
invention. At step 210, a line probe session may be initiated. At step 212, a process for 
20 preparing to probe R rates may be initiated. At step 214, it may be determined whether i 
= R-l, where i represents an index for the rate. If so, PBO may be initiated for a chosen 
symbol rate, at step 216. The line probe session may be terminated at step 218. If i does 
not equal R-l, rate i may be configured, at step 220. At step 222, a noise signal may be 
sampled. At step 224, a discrete Fourier transform (DFT) may be computed for the noise 
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signal. At step 226, noise power spectral density (PSD) may be estimated for the noise 
signal. At step 228, relevant information may be stored. At step 230, a transmit signal 
may be sampled and detected. At step 232, a DFT may be computed for the transmit 
signal. At step 234, a signal and noise PSD may be estimated. At step 236, a SNR of 

5 frequency sub-bands may be computed. At step 238, sub-bands satisfying a condition 
(e.g., SNR > 1) may be summed. At step 240, relevant information may be stored. 
Following step 240, the process may return to step 214 to again determine whether i = R- 
1, where the variable i has been increased by a predetermined value (as shown by i++). 
For verification of a symbol rate with the new PBO value, the line probe can be restarted 

10 with a new PBO. 

According to an embodiment of the present invention, to compute the SNR based 
on frequency domain data, power spectrums of the signal and noise may be determined, 
as shown in the equations below. 



z K j k n l = S(jk) + W(Jk) (3) 



15 F(/:)=^-£[s(n) + w(n)]exp 



V N J 



W(k) = — > w(n)exp 

N ~ w F l N 



(4) 



s(n) = time sample of the received signal 

w{n)-n th time sample of the received noise 

N = window length in time samples used to compute spectrum 

S(k)= k th frequency suband of the received signal spectrum 

W(k) = k th frequency suband of the received noise spectrum 

Y(k)= k th frequency suband of signal plus noise spectrum 

j = J-L 

20 where S(k) represents a DFT of the signal; W(k) represents a DFT of noise; Y(k) 
represents a DFT of signal-plus-noise; s(n) represents a transmit signal; w(n) represents a 
noise signal. 
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10 



According to an embodiment of the present invention, these transforms may be 
computed with Fast Fourier Transform (FFT). A real 2N input sequence may be packed 
into an N point complex sequence as shown below. According to an embodiment of the 
present invention, transform weights may be computed on the fly using the method 
shown below. Frequency cells may be estimated using a number of methods other than a 
conventional DFT, such as autoregressive (AR), moving average (MA), autoregressive 
moving average (ARMA), quadrature mirror filter (QMF) filter bank, and other methods. 

To illustrate a geometric SNR, a geometric mean may be defined as follows: 



mtMgeametric 



n 



a, 



(5) 



a, = i' h sample used for average 

M = window size in samples used for average 

Using the geometric mean, a SNR of the channel may be computed using the 
following: 



SNR = 



] Y(k)-W(k)\ 2 



n 



\W(k) f 



(6) 



15 



SNR = 101og 



10 



n 

k=a 



\Y(k)-W(k)\ 
\W(k) | 2 



i-a+l 
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|s(fc)l 2 

\W(k)\ 2 



(7) 



which may be rewritten in the following manner to filter cells with negative or zero SNR 



D' k =log 
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\jmt 

\W(k)\ 
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D\>0 
otherwise 



(8) 



(9) 



16 



PATENT 

Attorney Docket No.: 56162.000353 



SNRhb = 



10 m \ 



p-a+l 



(10) 



where 0 < cs < so < N-l; S(k) represents an estimate of k th frequency sub-band of a 

5 received signal spectrum; W(k) represents an estimate of k* frequency sub-band of a 
received noise spectrum; Y(k) represents a k th frequency sub-band of signal plus noise 
spectrum; e& represents a starting sub-band; BO represents an ending sub-band; 
represents one or more sub-bands with SNR greater than zero; D\ represents SNR for k th 
sub-band. 

10 The following example illustrates a sub-band SNR calculation where the sub-band 

approach may be used to optimally shape a spectrum for maximum power back off. The 
FIGs. 3a, 3b, 4a and 4b illustrate two data rates, 768,000 bps and 384,000 bps. FIGs. 3a 
and 3b show the transmitted and received spectrums as well as the near-end crosstalk 
power spectral density (NEXT PSD), which is essentially the received noise spectrum. 

15 FIGs. 3a and 3b illustrate a noise scenario including 30 NEXT disturbers of 

ADSL downstream with a loop length of 5000 feet of 26 AWG twisted-pair, which is the 
noise spectrum generated by 30 twisted pair ADSL modems in the same wire bundle as 
the modem being simulated, according to an embodiment of the present invention. As 
can be seen by these plots, most of the noise energy is concentrated outside of the 

20 transmission band of the modem. More specifically, plot 310 represents 768k TX PSD, 
plot 312 represents 768k RX PSD, plot 314 represents 384k TX PSD, plot 316 represents 
384 RX PSD, and plot 318 represents NEXT PSD wherein RX plots 312 and 316 may be 
attenuated by the channel. When the SNR is computed in the time domain, some or all of 
out of band noise may be averaged in a SNR estimate. The SNR estimate may be 

25 considered conservative as much of the out of band noise may be removed with a receive 
filter and equalizer, for example. If the power cutback is based on this time domain SNR, 
it may also be considered conservative. Thus, using the sub-band SNR, in accordance 
with the present invention, provides the ability to back off the power more aggressively. 
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FIGs. 4a and 4b illustrate a sub-band SNR, according to an embodiment of the 
present invention. As illustrated, frequency domain SNR for certain frequencies may be 
significantly higher than time domain numbers included on the plot. Plot 410 represents 
768k sub-band SNR and plot 412 represents 384k sub-band SNR, as shown in FIGs. 4a 
5 and 4b. For example, if the required SNR to achieve the minimum BER is 24 dB, the 
time domain SNR for 768k may suggest a maximum of approximately 2 dB PBO. 
However, looking at the frequency domain SNR computed with the geometric mean 
across a transmission band, with an average SNR of 87.48 dB, the PBO may be 
significantly more. For example, 26.21 (SNR time) minus 24 (min BER) equals 2.21 dB 

10 while 87.48 minus 24.00 equals 63.48 dB. 

One way to achieve a maximum (or optimal) PBO involves shaping the spectrum 
such that frequencies with the highest SNR are attenuated the most and frequencies 
where the SNR is close to the threshold are not cutback or minimally cutback. For 
example, as shown in FIGs. 4a and 4b, the SNR may increase as frequency decreases. 

15 The filter's spectral shape may gradually increase the transmitted power with increasing 
frequency, maintaining an approximately constant SNR throughout the passband. 

The following equations illustrate a method for packing a 2N-point real input 
signal into an N-point complex input for efficient computation of the FFT. 
y(n) = s(2n) +js(2n + 1) = h(n) +jg(n) (1 1) 

20 where n = 0, 1 N-l 

-2-;r- j-k-n^ 



™ n=0 



N 



= R(k)+jI(k) 



S R (k) = - [R(k) + R(N -k)] + - cos 
2 2 



j 

r 7tk^ 

[n) 



1 



[I(k) + I(N-k)]-^sm^ 



(12) 

[R(k)-R(N-k)] 
(13) 



25 Stfc) = j [/(£) - I(N -k)]+- sin' ^ 
2 2 



N 



[I(k) + I(N-k)]-Uos^ 



[R(k)-R(N-k)] 



(14) 
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Equations (11) to (14) are described in "The Fast Fourier Transforms and it's 
Applications" by E. Oran Brigham -1988 - Figure 9.15, page 193. 

A recursive calculation of FFT weights may be implemented to save storage space 
until FFT is performed. 



exp 



2 • k • j • n • k 
N 



1= cos 



V 



N 



-J sin 



J 



N 



(15) 



10 



Equation (15) is described in 'The Fast Fourier Transforms and it's Applications" 
by E. Oran Brigham -1988 - equation 6.16, page 97. 
The first cosine and sine terms may be found using the equations below. 



Ro = cos 



' 4K ^ 



N. 



real 



Io = -sin 



where 



N 



(16) 



(17) 



N real = real FFT size 



R 0 = zero* sample of real part of exponential weight 

I 0 - zero* sample of imaginary part of exponential weight 

15 The equations to recursively compute the transform weights are given below: 



20 



R m = Ro 3 Rm-i - Io 3 I m -i 
Im - Io 3 R m _i + Ro 3 I m _i 



(18) 
(19) 



where m = 1, 2, t 



N 



real 



R m =m^ sample of real part of exponential weight 

I m = m* sample of imaginary part of exponential weight 

Since the input to the FFT is real, it may be more efficient to form a complex 
input to the FFT and then unpack the results to obtain the spectrum of the original real 



19 



PATENT 

Attorney Docket No.: 56162.000353 



signal. This may involve the computation of additional weights as shown in equations 
(20) and (21). 



R p = cos 



2-K- p 



N, 



real 



(20) 



10 



I p = sin 



where 



p = l,2- 



AT, 



real 



(21) 



N. 



real 



R p sample of cosine recombination weights 
l p = sample of sine recombination weights 

To avoid calling the sine and cosine function for each weight, the identity for 
computing the weights recursively may be derived as in equations (18) and (19) above. 

Starting with the trigonometric identities below, a recursive equation may be used 
to find the terms in (20) and (21). 

cos(A ± B) = cos(A)cos(B)T sin(A)sin(5) (22) 
sin(A± B)= sin(A)cos(5)±cos(A)sin(5) (23) 



15 R p - cos 




, 2-Mp-l) \ 

^real j 
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- sin 



2-7T ^ 
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y £y real J 
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I p = sin 
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y iy real 
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real 
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(24) 



(25) 
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This gives a recursion similar to that in equations (16), (17), (18) and (19) above. This is 
further illustrated in equations (26) and (27) below. 



Ro = cos 



'2^ 



(26) 



Io - sin 



l-K 



N, 



real 



(27) 



where 

N rea i = real FFT size 



10 Equations (18) and (19) may be modified slightly and then used with the above 

initializers to compute the new weights. 



R m = Ro 3 R m -i - Io 3 Im-i 
I m = Io 3 R m -i + Ro 3 I m .i 

15 

wherem=l,2,T 

Another embodiment of the present invention may be directed to determining an 
optimal G.SHDSL data rate using sub-band capacity. In the G.SHDSL standard, 
provisions may be made for rate negotiation to take place between two communicating 

20 modems after a line probe (LP) session. During LP, a first modem transmits a signal 
while a second modem samples a received signal to determine effects of line conditions 
on a communications channel. For example, performing a signal-to-noise-ratio (SNR) 
calculation in the time-domain gives an indication of an amount of signal power above 
that of the background noise. A problem may arise when this SNR value is used to 

25 compute the capacity of the channel since the capacity theorem, as described below, 
makes an assumption that the noise is additive white and gaussian (AWGN). When 



(28) 
(29) 
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residue echo and NEXT noise are taken into consideration, the noise is no longer AWGN. 
In fact, the noise may not have been true AWGN. 

The present invention provides an approach to rate negotiation that implements a 
geometric frequency domain approximation of channel capacity. The capacity for a 

5 plurality of M frequency sub-bands may be computed to find an overall capacity for a 
rate of interest (ROI). The sub-bands may be any segment of a total N frequency bands 
found with a discrete Fourier transform (DFT) or other method of spectrum estimation, 
such as an autoregressive (AR), moving average (MA), autoregressive-moving average 
(ARMA), quadrature mirror filter bank (QMF), and other methods. This geometric 

10 capacity may allow for a more accurate capacity calculation if the sub-band width is 
sufficiently small such that the noise within that band is approximately AWGN. 

As shown in FIG. la, during a line probe session, the G.SHDSL modem may 
determine which rates the modem may support given the line conditions. FIG. la 
illustrates the line probe (LP) session for G.HS (e.g., 114, 116) between STUR 110 and 

15 STUC 1 12, which lasts a maximum of approximately 10 seconds, for example. 

During the transmission of each probe signal, represented by Prx (e.g., Prl, Pr2, 
Pr3) and Pcx (e.g., Pel, Pc2) in FIG. la, a modem transmitting the probe may train 
associated automatic gain control (AGC) and echo canceller (EC). The modem may then 
measure the residual echo signal and use this measurement as an estimate of the noise 

20 spectrum. In addition, the modem may measure a silence power spectrum during the 
segments where no signals are transmitted and use this measurement as a noise spectrum 
estimate. The modem receiving the probe signal may measure a received signal and 
noise spectrum. One modem may transmit a probe signal at any given time. 

When the modems are not measuring received signals, the modems may calculate 

25 sub-band capacity for each rate using the methods described. 

If the SNR is calculated in the time domain, a method to compute the capacity 
may include measuring the silence power (noise), P no ise, and then the received power 
(signal + noise), P S i g nai+noise> and finding the capacity, C, using the equation below. 

30 
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C = Blog 2 



1 + - 



signal 



(T-G+y+5) 

P 10 10 

\^ noise 



( 



= Blog 2 



1+ 



SNR 
10 10 



bits 
second 



(30) 



where r represents a gap from a theoretical channel capacity for PAM signals, in dB; G 
represents a coding gain of a Trellis decoder in dB; B represents a transition bandwidth; 

5 fo represents a required margin in dB (e.g., G.SHDSL Annex B margin is approximately 
6 dB); and ca represents an implementation loss in dB. 

A problem with this estimate is not being localized in frequency. This may pose a 
problem when the noise is not AWGN. Using the sub-band capacity approach of the 
present invention, a frequency domain may be separated into narrow bands so that the 

10 signal and noise power for each sub-band may be determined. Capacity for each sub- 
band may be estimated using (30) above. According to an embodiment of the present 
invention, the noise within each sub-band may be assumed to be approximately AWGN. 
The more finely the signal spectrum is sampled, the closer the noise in each sub-band 
approximates AWGN. The geometric capacity may be computed to find a single number 

15 that may be compared between rates to determine an optimal (or best) rate of 
transmission. A simple way of separating the frequency domain into sub-bands is using 
the Discrete Fourier Transform (DFT), which is essentially a bank of band-pass filters. 

According to an embodiment of the present invention, rate selection may be based 
on a sub-band SNR calculation. FIG. 5 illustrates a flowchart for a line probe session 

20 with a sub-band capacity calculation, according to an embodiment of the present 
invention. At step 510, a line probe session is initiated. Step 512 involves preparing to 
probe R rates and step 514 involves determining, such as through comparing or 
calculation, whether i = R-l, where i represents the index for the rate. If so, an optimal 
(or best) rate is chosen based on the results of the capacity calculations for each of the 

25 probed rates. For example, the highest rate with a capacity greater than the rate itself 
may be chosen, at step 516 and line probe session is terminated at step 518. If i does not 
equal R - 1, rate i may be configured, at step 520. At step 522, a noise signal may be 
sampled. At step 524, a DFT may be computed for the noise signal. At step 526, noise 
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PSD may be estimated for the noise signal. At step 528, relevant information, such as the 
noise power for each frequency sub-band is stored for later use in computing the overall 
capacity for the given rate, for example. At step 530, a transmit signal is sampled and 
detected and step 532 involves computing a DFT for the transmit signal. Step 534 
5 involves estimating a signal and noise PSD. At step 536, capacity of frequency cells 
(e.g., sub-bands) may be computed. At step 538, summation of a plurality of frequency 
cells may be computed for a total capacity. At step 540, relevant information, such as the 
capacity calculated for the tested rate may be stored for use such as in determining the 
best rate after most or all rates have been probed. At step 514, it may determined 
10 whether i = R-l where the variable i has been increased by a predetermined value (as 
shown by i++). 

To compute the SNR based on frequency domain data, power spectrums of the 
signal and noise may be determined. This is shown in equations (29) and (30) below. 



15 F<*)=-|- |>(n) + w(n)]exp 



- 2 • k • j -k • n ^ 



N 



= S(k) + W(k) (31) 



W(k) = ^w(n)txp 



1 (32) 



Equations of (31) and (32) show N-point DFT's of the signal-plus-noise and 
20 noise, respectively. According to an embodiment of the present invention, transforms 
may be computed with the Fast Fourier Transform (FFT). The real 2N input sequence 
may be packed into an N point complex sequence as shown above. According to an 
embodiment of the present invention, transform weights may be computed recursively 
with two (or more) initialization variables using the method shown above. As mentioned 
25 previously, other methods may be implemented to estimate the power spectral density 
(PSD). 

Starting with equation (30) above, an overall capacity may be determined by 
summing capacities for each individual sub-band as shown by equation (33) below. 
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(33) 



D A. 

where B s = -, v ; 0 < < N-l; 5 5 represents a sub-band width in Hz; S(k) 

(P-a + 1) 

represents an estimated "signal only" power; r represents a gap from a theoretical 
channel capacity for PAM signals, in dB; G represents a coding gain of a Trellis decoder 
in dB; so represents a required margin in dB (e.g., G.SHDSL Annex B margin is 
approximately 6dB); c& represents an implementation loss in dB, a represents an index 
of a first sub-band and P represents an index of a last sub-band. 

The method above provides a figure of merit with which may be used to compare 
different symbol rates tested during line probing. For example, a requirement could be 
that the capacity needs to be larger than the desired data rate. 

Another embodiment of the present invention may be directed to optimizing the 
performance of non-DMT-based DSL by shaping the transmitted spectral density 
according to line conditions. The G.SHDSL and HDSL2 standards may specify the 
transmitted power to be less than a certain value at any given frequency. A plot of these 
values vs. frequency may be referred to as the "mask". Standards may also specify the 
total power to be within a certain range. Within these constraints, the shape of the 
transmitted power-spectral density (PSD) is left open to the designers. In general, the 
modem at one end of a communication connection, such as a pair of copper wires, may 
use a different transmitted PSD from the modem at the other end of the connection. 
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The present invention provides a method for choosing an optimal transmit PSD 
for a pair of modems given the line conditions (and/or other conditions). According to an 
embodiment of the present invention, line condition data may encompass power backoff 
values, as discussed in detail above. According to another embodiment of the present 

5 invention, line condition data may include data rate using sub-band capacity, as discussed 
in detail above. For example, line condition data may be based on an overall capacity 
wherein the overall capacity is determined by summing a geometric frequency domain 
approximation of channel capacity for a plurality of sub-bands, as discussed in detail 
above. Line condition data may further include a combination of power backoff values 

10 and data rate using sub-band capacity. Other forms of line condition data may be used by 
the present invention. The transmitted PSD may be chosen to minimize interference, 
such as echo and noise. Other forms of interference may also be minimized. 

This aspect of the present invention is directed to methods for shaping the 
transmitted PSD to maximize the performance of a pair of modems (e.g., non-DMT- 

15 based modems). In general, a different transmitted PSD may be used for different line 
conditions and for different data rates. In addition, this aspect of the present invention 
encompasses how the line conditions may be measured at the start of modem training. 

In the G.SHDSL and HDSL2 standards, the transmitted PSD may satisfy various 
constraints. For example, for any given frequency, the PSD may be less than a certain 

20 value where a plot of these maximum values vs. frequency is called a "mask". In another 
example, the total transmitted power may lie between a minimum and maximum value, 
where the maximum value is typically less than the power under the mask. These two 
constraints may leave some freedom to the designer to develop spectral shapes that yield 
an optimal performance for different line conditions. 

25 The power requirement for the transmit PSD is shown in equations (34) and (35) 



below. 




(34) 



\H(f]<\M(f] 



(35) 
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where H(f) represents a transmit PSD; M(f) represents a mask PSD; Pmin represents a 
minimum allowed transmit power as specified in the standard; P max represents a 
maximum allowed transmit power as specified in the standard and P mask represents an 
integral under the mask of PSD. 
5 The transmitted PSD may be chosen to minimize interference. Interference may 

include echo and noise, for example. Other forms of interference may also be 
considered. For a given modem, echo may be the part of the transmitted signal which 
leaks into the receiver or reflects back from interfaces and bridge taps. While this echo 
may be partially cancelled, there are frequencies at which the echo may be difficult to 
10 cancel. Therefore, decreasing the transmitted power of the local modem at those 
frequencies may reduce the un-canceled echo. 

For example, noise may include any unwanted signal from sources external to the 
modem. For any given frequency, the effects of noise may be reduced by increasing the 
received signal power. Received signal power may include the transmitted signal power 
15 of the remote modem after attenuation by the line connecting the two modems. 
Therefore, increasing the transmitted power of the remote modem at a particular 
frequency may reduce the effects of noise at that frequency on the local modem. This 
increase may be assumed to be within the constraints of the mask as shown in (34) above. 
Therefore, the performance of a modem may be characterized by an associated 
20 signal-to-noise ratio (SNR), which may be defined as follows: 

fM/fc(/f# \\r f (/K(fpf (36) 

where R e (f) and W(f) represent spectrums of independent random noise processes, and 
T0H c (f) are equal to a received spectrum as a function of frequency. 
25 The SNR may depend on the transmitted PSD, the channel attenuation, the echo 

power and/or the noise power, as well as other factors. For longer loops, the channel 
attenuation may be greater. The present invention enables the SNR for both modems to 
be above a certain minimum SNR. This minimum SNR may correspond to a minimum 
acceptable bit-error rate (BER). 
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Due to various factors, variations in connection or line conditions may exist. 
Terms used to describe line conditions may include high noise, short loop and long loop, 
for example. High noise may include a case where the noise floor is at a level, over any 
loop, to cause an SNR value which is close to the minimum SNR. Short loop may 
include a case where the channel attenuation of the far transmitted signal is small enough 
that, in the absence of noise, the SNR is significantly above the minimum SNR. Long 
loop may include a case where the channel attenuation of the far transmitted signal is 
large enough that, in the absence of noise, the SNR is approximately equal to the 
minimum SNR. 

Generally, modems tend not to perform well in the high-noise long-loop case. 
This means that a long-loop high-noise case is one in which the channel attenuation of 
the received signal as well as the receiver noise conditions are such that the signal-to- 
noise ratio (SNR) is not at a sufficient level to allow a bit error rate (BER) equal to or 
lower than the standard specification. Various possible scenarios exist as listed below 
where CO represents a "central office" modem and CPE represents a "customer premise 
equipment" modem. 

1. CPE low noise & CO low noise, long loop: CPE and CO may both have small 
SNR values due to channel attenuation. 

2. CPE low noise & CO high noise, short loop: CPE may have a large SNR margin 
due to low noise environment and low attenuation by the channel. CO may have a small 
SNR due to a high noise environment. 

3. CPE high noise & CO low noise short loop: CPE may have a small SNR due to a 
high noise environment and the CO may have a large SNR due to low noise environment 
along with low attenuation by the channel. 

4. CPE high noise & CO high noise, short loop: Both CO and CPE may have a high 
noise environment causing a low SNR. Channel attenuation may not be the limiting 
factor in this case. 

One aspect of the present invention addresses methods to deal with each of these 
cases by choosing specific transmit filters based on the line conditions determined during 
pre-activation. The following abbreviations will be employed: 
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1. LNLL: low-noise long-loop 

2. LNSL: low-noise short-loop 

3. HNSL: high-noise short-loop 

An embodiment of the present invention provides a method for determining line 
conditions. FIG. 6 illustrates a contour plot of the SNR versus arbitrary units of (noise + 
echo) power and arbitrary units of channel attenuation wherein 0 corresponds to the 
bottom right corner and graduates to 1 which corresponds to the upper left corner. Given 
measurements of the SNR and noise power, this plot may be used to determine the line 
conditions. The straight lines separating the quadrants may be replaced with lines of any 
shape. Based on where the SNR and noise power level falls within the plot, the line 
conditions may then be determined. In a hypothetical example presented in the plot, 
performance in all but the "high noise, long loop" quadrant may be optimized using this 
embodiment of the present invention. 

Another embodiment of the present invention relates to echo channel frequency 
response (e.g., echo spectrum). FIG. 7 is a block diagram of a G.SHDSL system's 
transmit and receive paths, according to an embodiment of the present invention. The 
dotted box encloses an echo channel 730. The echo channel 730 may not have a flat 
frequency response and may tend to pass low frequencies with less attenuation than 
higher frequencies. These low frequencies may contribute partially or significantly to the 
length (e.g., temporal) of the echo and in turn the ability of the fixed length (e.g., in time) 
digital echo canceller that may cancel the echo. 

A precoder output from precoder 710 may be an input into transmit filter 712, as 
well as digital adaptive echo canceller 722. A transmit spectrum of the near modem may 
be an output of transmit filter 712, which may then be an input to analog front end 714. 
Analog front end 714 may transmit a time varying frequency response of a digital 
adaptive echo canceller to hybrid circuit 716. Analog front end 714 may also be coupled 
to twisted pair of copper wire 718 which is communicative with a transmitter of far end 
modem 720. An output of analog front end 7 14, an output of hybrid 7 16 and a composite 
spectrum of channel, receiver, crosstalk and quantization noises may be inputs to summer 
724. The output of summer 724 may be an input to summer 726, where the output of 
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summer 724 may be summed with an output of digital adaptive echo canceller 722. The 
output of summer 726 may include an error of adaptive echo cancellation algorithm. 

As for echo, in the G.SHDSL and HDSL2 standards, both modems may share the 
same twisted pair of copper wire 718. This means that the transmit signal may leak 
through the echo channel into the receive path. The echo may be defined as the 
convolution of the transmitted signal with the echo channel. 

An echo channel 730 may include analog hardware that the transmitted signal 
passes through before it is seen in the receive path. The echo channel 730 may include an 
analog front end 714 and a hybrid 716. The hybrid 716 may act as an analog echo 
canceller to remove at least a portion of the echo. 

Residual echo may include the echo that remains after analog and digital adaptive 
echo cancellation. In some cases, it may not be possible or practical to cancel some or all 

the echo. Thus, it is desirable to make the residual echo as small as possible. 

According to an embodiment of the present invention, the signals in FIG. 7 are 

defined as follows: 

X (f ) = precoder output 

H txf (f ) = frequency response of transmit filter 

T s (f)= transmit spectrum of the near modem 

T f (f)= transmit spectrum of the far modem 

H c (f) = frequency response of the channel 

H dec (f ) = time varying frequency response of digital adaptive echo canceller 
H ec (/ ) = frequency response of the echo channel 
E(f ) = error of adaptive echo cancellation algorithm 

W{f) = composite spectrum of channel, receiver, crosstalk and quantization noises 

As shown in FIG. 7, an output of the precoder 710 may have an approximately 
flat power spectrum. Keeping this in mind while tracing the signal paths in the above 
block diagram, the following may apply: 

X(f){K= constant ( 37 ) 
Y(f) = X(f)H tx j<f)Hec(f) + T/f)H c tf) + W(f) (38) 
Z(f) = X(f)H dec (f) ( 39 ) 
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E(f) =Y(f)-Z(f) = [H,^H ec (f) - H de MW + TJf)H c (f) + W(f) 

= [H t ^(f)H ec (f) - H d ec(f)]K + T0)H c (f) + W(J) (40) 

where R e (f) is defined as [H t ^j)H ec (f) - H dec (f)]K wherein R e (f) represents residual echo 
5 spectrum, then E(f) = R e (f) + Tj<f)H c (f) + W(f). 

Looking at equations (37) through (40), the signal at the output of the summer 
726 ( £(/)) may include the received signal plus residual echo plus the total noise power 
(e.g., crosstalk + quantization noise + channel noise + receiver noise). During the initial 
training of the echo canceller, the received signal may not be present. As a result, the 
10 output of the summer 726 may become as follows in equation (41). 

E(f)= R e (f)+W(f) (41) 

FIGs. 8, 9, 10 and 11 illustrate examples of the echo channel response for two 
different G.SHDSL data rates: 2304 kbps and 192 kbps. In this example, the simulation 
may assume the modem is attached to 15kft of 26-AWG (American Wire Gauge) wire. 
15 As shown in FIGs. 8, 9, 10 and 1 1, a large fraction of the echo power may reside 

in lower frequencies, because of reduced attenuation. As shown in FIG. 9 and FIG. 11, 
which are zoomed versions of FIG. 8 and FIG. 10 at frequencies 0 to 20 kHz, 
considerably less power is attenuated at the frequencies below 20kHz. The residual echo 
spectrum is the echo channel frequency response multiplied by a transmit filter spectrum 
20 of T s {f)H ec {f) shown in FIG. 7 above. It is thus desirable to minimize residual echo 
power at the output of the summer (e.g., 726), such as R e (f) in equation (41), as much as 

possible, as described in further detail below. 

Another aspect of an embodiment of the present invention involves improved 
performance on long loops, hi the case where both the CO and CPE have a LNLL 
25 scenario, the performance of the system may be considered echo limited. With a long 
loop, the received signal may be greatly attenuated and may be considered small relative 
to the residual echo. The analog hybrid may provide some echo cancellation and the 
digital echo canceller may cancel the echo to a power level below that of the received 
signal. The resulting residual echo at the output of the summer 726 in FIG. 7 (£(/)) 
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may effectively become the noise floor of the receiver. By shaping the transmit power 
spectrum such that the low frequencies are not transmitted at a significant power level, 
the residual echo power may be reduced and therefore the reach may be extended. Reach 
may relate to the maximum length of twisted pair wire over which the modems can 

5 maintain the minimum SNR. This may be contrary to the conventional wisdom that the 
transmitted PSD should not be reduced in the low-frequency regions where the channel 
attenuation is less than in the high-frequency regions. 

As a result, if both modems transmit less power in low frequencies, the SNR may 
effectively improve. This may be considered counterintuitive in some applications since 

10 the local modem is reducing its noise floor by transmitting less energy in the lower 
frequencies, but at the same time the remote modem is also transmitting less power at 
those frequencies. A reason for SNR improvement may be attributed to nonlinearities in 
the analog portion of the echo channel. The nonlinear echo channel may produce noise at 
frequencies other than the transmitted frequency. That is, if a tone is transmitted through 

15 the echo channel at a particular power level, it may generate harmonics at higher 
frequencies. When the amplitude of the transmitted sinusoid is reduced, the power of the 
harmonics may reduce nonlinearly. The nonlinear effects occur more severely at the 
lowest frequencies, which makes it more detrimental to lower data rates. 

An example of this is shown in the FIG. 12. FIG. 12 illustrates a transmit PSD 

20 that reduces nonlinear effects at lower frequencies, according to an embodiment of the 
present invention. Comparing an overall PSD 1214 to a mask 1210, the transmitted 
power at lower frequencies may be significantly less. An upper bound may be illustrated 
by 1212. However, at the rate shown, the lower powered filter may actually perform at a 
higher SNR because the noise floor at some or all frequencies has been lowered by 

25 reducing the transmitted power at lower frequencies. There is an optimum low-frequency 
cutoff for each rate, which may be determined empirically, for example. In this example, 
peak-to-average ratio (PAR) equals 16.3 (2.9), total power is 10.3 dbm where an actual 
cutoff is at 12.0 khz. 
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Reducing the transmitted power in low frequencies may result in an increase in 
the maximum attainable line length over that obtainable with filters that may be 
optimized for noise performance. 

Another aspect of this embodiment of the present invention involves improved 
5 performance on mixed cases. For the case where one side, either the CPE or CO, 
experiences a LNSL environment and the opposite side experiences a HNSL 
environment, the noise limited side may experience an improvement in performance by 
shaping the filters on both sides differently. 

In the case where one side experiences a low noise environment and the other is 
10 in a high noise environment, as determined from comparing test results to some threshold 
value, the modems may exchange information that lets the other side know what type of 
environment it sees. Each side may then shape a transmit spectrum to benefit both itself 
and the far modem. 

FIG. 13 is an example of a flowchart for selecting a filter, according to an 
15 embodiment of the present invention. At step 1310, a CO side may determine it is in a 
LNSL situation using a proprietary line probe session. At step 1312, a CPE may 
determine it is in a HNSL situation using a proprietary line probe session. At step 1314, 
both sides may exchange information to indicate what type of environment it sees. At 
step 1316, the CPE may choose a filter that reduces transmit power in the low frequencies 
20 thereby lowering echo power, and in turn lowering its overall noise floor (due to 
nonlinearities in the echo channel). It may do this because it knows the CO is in a low- 
noise environment with a certain level of margin and may tolerate a reduction in the 
CPE's transmit power. In addition, the CPE may lower its overall transmit power across 
some or all frequencies using power back off (PBO) either alone or in addition to the high 
25 pass filtering. 

At step 1318, the CO may choose a filter that may be designed for maximum 
noise performance on the far end. It may tolerate an increase in its echo level because it 
is in a non-echo-limited case, wherein lower frequencies may not be filtered. This may 
increase the received power at these frequencies on the far end, thereby improving the 
30 CPE (which may be noise limited) SNR. At step 1320, the reduction in the CPE transmit 
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power and the increase in the CO residual echo power may be chosen judiciously so as 
not to decrease the CO's margin to an unacceptable level. 

An end result of this filter selection method is to give a customer or other user 
extra noise margin, which may be defined as the amount of noise that may be added after 

5 the modems go to steady-state while maintaining an SNR above the minimum SNR. 

FIG. 14 is a system diagram of a communication network, according to an 
embodiment of the present invention. A communication network may involve, at least, a 
CO side structure 1410 in communication with a CPE side structure 1430 via a 
communication network 1420. In particular, CO side 1410 may include modem 1412 in 

10 communication with an Analog Front End 1414. CPE side 1430 may include Analog 
Front End 1432 in communication with modem 1434. The system described in FIG. 14 
may incorporate various inventive aspects of the present invention. 

Another aspect of this embodiment of the present invention involves improved 
performance on noise limited cases. The present invention may also address cases where 

15 both sides experience a HNSL environment by using transmitted PSDs that may be 
optimized for noise. This may be in contrast to using a transmitted PSD that is a 
compromise between one that is optimized for noise and one that is optimized for reach. 

Another aspect of this embodiment of the present invention involves minimizing 
the number of required transmitted PSDs. A transmitted PSD may be stored as a set of 

20 transmit-filter coefficients. Therefore, increasing the number of transmitted PSDs may 
require more storage for these coefficients. Various factors may increase a desired 
number of transmitted PSDs. For example, for a given data rate, there may be an infinite 
number of transmitted PSDs where each one may be tailored to specific line conditions. 

According to another embodiment of the present invention, the number of 

25 transmitted PSDs may be limited to at least two where one may be optimized for noise 
performance and one may be optimized for reach performance. For line conditions which 
may be a mixture of noise and reach, an interpolation between the two may be 
implemented. In general, N PSDs may exist where interpolation among these N PSDs 
may occur. 
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In general, there may be a different set of transmitted PSD's for each data rate. 
This is because certain factors may be different for each data rate. Such factors may 
include one or more of residual-echo spectrum, the noise spectrum, and the transmitted- 
PSD mask. 

5 It may be desirable to reduce the number of transmit filters stored. A way to 

accomplish the reduction may include designing a smaller subset of filters that have 
normalized parameters. For example, if a good LNLL filter for data rate 2304 kbps has a 
cutoff of 30kHz and a good LNLL filter for 768 kbps has a cutoff of 10kHz, the same 
(similar or related) filter may be used for both rates. A digital filter may be designed to 
10 have a cutoff of 0.03866*Fs/2, for example, and that may give approximately the desired 
cutoff at both rates. In addition, a noise filter may be designed at a low rate with a small 
cutoff, which may then be used at a higher rate as a LNLL filter with a higher cutoff. 

Another embodiment of the present invention may be directed to varying 
G.SHDSL echo canceller filter lengths based on data rate. As outlined in the G.SHDSL 
1 5 standard, modems may be required to support data rates between 64kbps and 23 1 2kbps in 
8kbps increments, for example. Other requirements or recommendations may be 
implemented. When using a fixed length digital adaptive echo canceller filter to model 
an echo channel, a large number of filter taps may go unused depending on the data rate 
and/or other factors. This may occur specifically when the data rate is decreased because 
20 the time spacing between successive samples increases. In other words, the echo may use 
a fewer number of filter taps before it decays to a negligible value. 

An aspect of the present invention provides a method and system for choosing a 
reduced length filter based on data rate. A polynomial may be fit to a curve where the 
curve represents an approximate echo channel length for a complete (or other defined) 
25 range of data rates. When a data rate is chosen by a G.SHDSL modem, an echo canceller 
filter length may be chosen accordingly. This allows for reduced CPU cycles for 
performing an adaptive algorithm. Since training time is constrained to a fixed time 
duration, this allows for a longer period of training time at lower rates, thereby reaching a 
lower mean squared error (MSE) than if a full length filter were implemented. 
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G.SHDSL modems may transmit full duplex over a single twisted pair of copper 
wire. Both upstream and downstream signals may share the same (or similar) frequency 
bandwidth. For a given modem, a transmit signal may be removed from a receive signal 
with an echo canceller. The echo cancellation function may be implemented twice, once 

5 in the analog domain and once in the digital domain. An embodiment of the present 
invention is directed towards the digital echo canceller. The impulse response of the 
echo channel may be infinite. A digital adaptive filter that attempts to identify this 
channel may have a finite length. Because some designs rely on a fixed length echo 
canceller filter where the number of taps may be chosen conservatively based on 

10 experimental data to model the echo channel, perhaps for a worse case situation, there 
may be many unused taps for certain configurations. 

Typically, digital echo canceller coefficients may train to a noise floor. The noise 
floor is close to a quantization noise floor. For many cases, a large number of taps 
towards the end of the filter may have small values (e.g., close to one least significant bit 

15 (LSB) or other cutoff value, such as several LSBs) and be assumed to have little impact 
on the MSE. The point when the echo canceller coefficients decay to these small values 
(or other predetermined values), if they do, is variable and may depend on various 
conditions, such as one or more of data rate, transmit filter design, analog circuitry such 
as hybrid and line transformer, line conditions such as the placement of bridge taps, 

20 and/or other conditions. The length of an echo tail (e.g., the tail of the echo channel 
impulse response and the number of coefficients used) may depend on one or more of the 
above conditions. 

Another phenomena occurs when there is a fixed delay in time of the echo 
through the analog circuitry. This may cause a section of taps at the beginning of the 
25 filter to take on small values close to a quantization noise floor. In this case, it may be 
desirable to have a programmable hardware delay that adjusts for this analog delay. 
Since the delay time may be fixed, a number of digital echo canceller taps that may be 
implemented to adjust for the delay time may vary with data rate. Similarly, the 
programmable delay may vary. Typically, the number of taps that may be removed from 
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the tail may be much larger than those that may be removed from the start of the filter, 
for example. 

For higher data rates, a transmit filter may be designed to transmit less energy at 
lower frequencies and more power at higher frequencies, within the constraints of the 
standard, such that the echo tail is shorter. This may allow for the use of a shorter echo 
canceller filter, thereby saving hardware capacity as well as operating power. 

If the echo canceller filter is designed in hardware, further savings in power 
consumption may be made at some or all rates. The hardware may be designed such that 
banks of filter coefficients may be shut off when they are not being used, thereby 
reducing overall power consumed by the processor since no or minimal multiplications 
(or other operations or calculations) may be performed on those coefficients. 

FIGs. 15 and 16 illustrate low and high rate echo channel impulse responses for a 
digital echo canceller, according to an embodiment of the present invention. 

FIG. 15 is a chart illustrating echo canceller filter taps for a specified rate, 
according to an embodiment of the present invention. More specifically, FIG. 15 
illustrates echo canceller filter taps for the rate 192 kbps. Coefficient magnitude in dB is 
illustrated with respect to coefficient number or filter taps. In this example, digital echo 
canceller coefficients may be at 192 kbps with 5500m PE04 twisted pair channel. As can 
be seen on the plot, beyond approximately tap 75, the filter coefficients may be small 
enough to be discarded as these coefficients approach the 16-bit noise floor, of 
approximately -90.31 dB. 

In another example, digital echo canceller coefficients may be at 1544 kbps with 
4000m PE04 twisted pair channel. FIG. 16 is a chart illustrating echo canceller filter taps 
for 1544 kbps. It can be seen from the plot that the entire filter length may be considered 
to accurately model the echo channel. 

FIGs. 17 and 18 illustrate various ways to adjust the number of digital echo 
canceller filter taps based on data rate, according to further embodiments of the present 
invention. 

As shown in FIG. 17, an approximate echo channel length may be measured at 
various data rates. A polynomial may then be fit to the resulting curve. When a data rate 
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is chosen, the rate may be an input to the polynomial function and the output may be the 
approximate number of filter taps that may be used to model the channel for the given 
rate. This may be used during an EC training stage to determine the number of taps that 
may be implemented for the configured data rate. 
5 As shown in FIG. 18, another method may involve plotting an echo channel 

length at a select number of frequencies versus data rate. A linear curve may be drawn 
above some or all the points. This provides a simple linear scaling of the filter. 

Another method for choosing the echo canceller length may involve performing 
an analysis of the echo canceller coefficients after the training session that takes place 
10 during the optional line probe session, as described. The echo canceller may be trained to 
determine the spectrum of the residual echo, which becomes the noise floor of the 
system. This information is then used to determine rate and power back off as well as 
spectral shaping. The echo canceller is trained again at a later time, for a larger number 
of training symbols, but the coefficients generated during the abbreviated training session 
15 that takes place during line probe may be used to determine more exactly the optimum 
length echo canceller for that particular modem. 

A simple way of determining which point to truncate the echo canceller filter may 
involve a mean filter. This is illustrated in the equation (42) below. 



20 v*=^i>*-,- *=W-m-i W 

1" 1=0 

where M represents a maximum length in taps of an echo canceller filter and w represents 
the filter coefficients. 

The output of this filter may be observed, and when it drops below a 
predetermined threshold T, the remaining taps are assumed to be unnecessary. The 
25 threshold may be chosen to correspond to a particular dB value. For example, if the filter 
coefficient is a 16-bit fixed-point quantity and a coefficient > 80 dB below the peak is 
considered irrelevant, the following may be used to determine the threshold. 
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201og 10 -| 




(43) 



-80 



where T = 2 l5 10 20 =3.2768 

The output of the mean filter is compared to the threshold and when it is consistently 
5 below the threshold, those taps may be ignored. In another example, if a processor uses 
24 bit precision, 2 23 may be used, instead of 2 15 in the above equation (43). In this 
example, since a quantization noise floor is approximately 138 dB, a cutoff point of 
approximately 120 dB may be selected. The cutoff point is a point at which a coefficient 
is determined to have a minimal effect on the fdter output. 
10 There are various advantages to this method of the present invention. First, it may 

not require an experimental determination of the filter length curve as shown in FIG. 17, 
for example. Second, the worst case need not be chosen all the time. The filter length 
may be chosen based on the line conditions, which may be different for each modem. 



15 canceller filter length for a particular one or more rates. These factors may be considered 
on a subset of the total possible rates such that sufficient information may be available to 
estimate the curves in the figures mentioned above. 

For example, a transmit signal may experience a delay as the signal leaks through 
to a receive path. As a result, the number of taps that may be used to model this delay 

20 may vary based on rate because the time spacing between successive samples may be 
inversely proportional to the rate. It may be desirable to compensate for this delay and 
for the delay offset to vary with rate. By ensuring the first several taps of the filter are 
used, taps from the filter tail may be removed. In the case of higher rates (e.g., where the 
echo tail may occupy some or all of the available taps), the usage of the available taps 

25 may be improved to increase performance. 

Another factor may involve measuring an echo response for a case where the 
noise floor is very close to a quantization noise floor. This allows for a more accurate 
determination of where the filter taps decay to small enough values to discard. For cases 



The following factors may be considered when determining a maximum echo 
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where the noise floor is higher, the system may not be echo limited anyway. As a result, 
the reduced number of taps will not harm performance. 

Another factor may involve measuring the echo response for assumed worse case 
bridge tap conditions. After the factors above have been investigated for a given rate, the 
5 number of taps used in the echo canceller filter may be determined for that rate. 

The following discussion describes a block-adaptive LMS echo cancellation 
algorithm and various implementations thereof. One purpose of an echo canceller may 
include removing an image of a transmit signal that leaks through a hybrid and mixes 
with a received signal. This may occur because the transmit and receive signals share the 
10 same twisted pair transmission line. The echo canceller may include a hardware module 
that performs an adaptive weight update and filters the transmit signal to generate an echo 
model which may then be subtracted from the received signal. Routines, modules, 
instructions, etc. may be employed to select or generate certain parameters that effect 
system performance. 

15 FIG. 19 is a block diagram illustrating basic operations of an echo canceller, 

according to an embodiment of the present invention. A transmit signal may have an 
effective sampling rate increase by a factor of two, for example, enabling the echo 
canceller filter to also act as an interpolating filter. 

Tomlinson Precoder (TP) 1910 generates signal x(n), where at least a portion of 

20 signal x(n) is received by transmit filter (TXF) 1918. Other types of precoders and/or 
filters may be implemented. The output of TP 1910 is further coupled to delay d2 1912 
and LMS algorithm 1916, wherein x(n) may represent a T- spaced output of a Tomlinson 
precoder wherein T represents the reciprocal of the symbol rate. Filter 1918 outputs a 
signal (e.g, a T/4-spaced output), which may be received by digital-to-analog converter 

25 1920. An analog version of the signal may be received by echo canceller and analog 
front end block 1922. Subtractor 1924 subtracts an estimate of the echo from the receive 
signal. Signal s(n) + n x , which represents the receive signal plus channel noise, may be 
received by delay d3 1926, which may be further coupled to analog-to-digital converter 
1928. A resulting signal T/4 may be received by block 1930, where decimation M is 

30 defined as 2, or other predetermined value. The resulting signal d(n) (e.g., a T/2 spaced 
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output) may be subtracted from signal y(n) (e.g., a T/2 spaced output) by summer 1936 
and inputted to delay dl 1934 for generating signal e(n) (e.g., a T/2 spaced output). The 
error signal e(n) may be received by LMS algorithm 1916 for generating signal w(n), 
which may be received by echo canceller filter 1914. Echo canceller filter 1914 is 

5 coupled to delay d2 1912 at an input and generates signal y(n). 

A block-adaptive LMS algorithm 1916 periodically updates a weight vector. The 
echo canceller may be fractionally spaced, where the sample rate at an output is different 
than at an input. In a particular implementation, the echo canceller may implement an 
interpolation rate of 2, for example. Other interpolation rates may be implemented. The 

10 weight vector may contain a set of coefficients of a Finite Impulse Response (FIR) filter, 
which may be implemented in hardware. A software algorithm may write the updated 
coefficients to a memory that may be read by a hardware-based filtering block. 

Another example may involve a fractionally spaced block LMS adaptive filter. 
An entire block LMS algorithm may be implemented in software where the algorithm 

15 may be fractionally spaced. Gear shifting may also be used to optimize the 
convergence/Minimum Mean Squared Error (MMSE) tradeoff, as further described 
below. A gear may be a particular value of the step size \l, shown in equation (44) below. 
An example of a weight update equation for the standard LMS algorithm is shown in 
equation (44) below 

20 

w(n+l) = w(n) + ^L[e(n)x(n)] (44) 
where the error signal is given by equation (46) as seen in FIG. 19. 



25 e(n) = d(n) - w T (n)x(n) 

where 

d (n)is the desired response at time index 'n' ; 
w(n)is the weight vector at time index 'n'; 
x(n)is the input vector at time index 'n'; and 
e(n)is the error at time index 'n'. 



(45) 
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and 

E((Q) =D((0)-W(0))X((O) 

= H(Gj)X((d) + S(co) + N X (G>) - W(0))X(C0) 
5 = [H((o) - W(co)]X((Q) + S(a>) + N x ((o) (46) 

where 

E(o))is the spectrum of the error; 

Z)(t»)is the spectrum of the desired response; 

W(co)is the frequency response of the adapted filter; 

X ((o)is the spectrum of the input signal; 

S (to) is the spectrum of the received signal; and 

N x (co)is the spectrum of the noise. 



As shown in equation (46), H(co) may represent the frequency response of the 
echo-channel (e.g., composite transmit filter, echo channel and analog circuitry). 

10 Equation (46) shows that as the frequency response of the adapted filter coefficients 
represented by W(a>) more closely matches the frequency response of the echo-channel, 
the error signal may be dominated by the received signal plus some, N x ((o). 

Since the update of the weights in hardware may be accomplished while the 
hardware block is operating in real time, the weights may be updated periodically by 

15 averaging a weight vector over N blocks and then writing the averaged weight vector to 
hardware. This means for every N weight vectors that are updated by the software 
algorithm, one weight vector may be written to hardware. The averaging of the weight 
vector may be shown in equation (47) below. 



20 



w(n + 1) = w(n)+ il[e{n)x{n)] 

w(n + 2) = w(n + 1)+ jl [e{n + l)x(n + 1)] 

w(n + N)= w(n + N - 1)+ \i [e(n + N - l)x(n + N - 1)] 



(47) 
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By substituting the first line of (47) in the second, and the second into the third 
and so on, (47) becomes equivalent to equations (48) through (52) shown below. 



10 



w(n + N) = w(n)+ fi^ [e(n + i)x(n + i)] 



i=0 



5 or more compactly 

w(n + N)= w(*)+ /j[x(n)e(fi)] 
where 

X(n)=[x(n) x(/i + l) - x(« + 7V-l)] 
e(n + l) 



e(»)= 



x(«)= 



x(n-l) 



(48) 

(49) 
(50) 

(51) 



(52) 



The above equations (48) through (52) may be referred to as the block-LMS 
algorithm, as illustrated by 1916 in FIG. 19, where the addition of the gradient estimate 
15 may be saved to the weight vector at each iteration. Instead, the weight vector may be 
updated at every N iterations, for example. The error signal in equation (47) and 
equations (48) through (52) is no longer the same as (44) since the weight vector is not 
updated at every sample. The N samples of the error signal may come from the weight 
vector at time index n. 

20 Since the algorithm may also be fractionally spaced with an interpolation by 2, for 

example, the delays shown in block diagram FIG. 19 may be incorporated. As a result, 
equations (48) through (52) may be modified slightly as shown in equations (53) through 
(58) below. 
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N-l 



i=0 



w(n + N) = w(n)+ ju(n)£ + i)x(n + *)] 
becomes 

w evert (n + AT) = 2-{ w* v ™ (*)+ ji(n)£ [e(2(i + d 2 )+ (1,+ w)x(n + i)] 



AM 



i=0 
AM 



5 w"** (n + iV) = 2" w 0 ^ (»)+ ju(n)£ [e(2(i + d 2 )+ n + l)x(n + i)] 



i=0 



x{n) 
x(n - 1) 

x(n-127) 



(48) 



(53) 
(54) 

(55) 



w(n)= 



w 0 (») 
w,(n) 



(56) 



10 w eve "(n)= 



w 0 (n) 
w 2 (n) 



(57) 



vt oid {n) = 



w 



^255 M. 



(58) 



Where "i" shown in equations (53) through (58), equation (59) and equations (60) and 
15 (61) below, represents the scale adjustment made when \wj(n)\ > = for any j and a 
particular time instant n, and "shift" is the programmable shift value in the ECF. The 
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variable "i" may take on values of 0 or 1 and the constant "shift" may be decremented 
every time i=l, or at the occurrence of another condition. 

The even and odd coefficient vectors shown above may form a polyphase filter 
that interpolates the input data by a factor of two, for example. Other factors may be 

5 implemented. The step value (n) may be time-varying to account for the periodic gear- 
shifting that takes place. The indices dl (as shown by 1934 in FIG. 19) and d2 (as shown 
by 1912 at FIG. 19) may account for the delays shown in the block diagram of FIG. 19. 

A DC offset estimate may also be used as an input to the hardware block. The 
DC term may be updated during the initial training while gear shifting is in effect. This 

10 computation is shown below. 



i 1023 

dc =— S>(i) (59) 
1024^ w 

Gear shifting may be implemented to make the convergence of the algorithm 
15 occur more quickly. The excess mean squared error (EMSE) may be proportional to the 
step size. However, the number of samples that may be used to converge may be 
inversely proportional to the step size. Therefore, the desire may be to balance the 
convergence speed with the excess mean squared error. 

Gear shifting may involve using different values of a scale factor jd at different 
20 points in the echo canceller training. When the initial training starts, a goal may be to 
traverse as much of the error performance surface as possible towards the goal of the 
minimum mean squared error. The following describes an example of a gear shifting 
procedure. 

1. Start with a large step size that allows a fast convergence to same EMSE value. 
25 2. After the desired EMSE is obtained, switch to a new smaller step size, which allows 
slower convergence to a new smaller EMSE. 
3. Repeat step 2 until the final desired EMSE is achieved. 

The optimum shift points may be determined by software. The following table 
lists the gear-shift point in samples and the right shift (e.g., power of two) division of the 
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weights. These gears may be used in the initial training. While in steady state, a single 
gear may be used and may be approximately Vi the smallest <D in the table. 



Gear# 0 1 

5 Samples 2000 598 

Right 3 4 
Shift 



2 3 4 5 
1427 3188 7241 15000 
5 6 7 8 



The hardware portion of the echo canceller may perform a linear discrete time 
10 convolution as described by equations (60) and (61) below. 

y {in) = 2" ( **" ,) t T (n - d 2 V ™ (n)J- dc (60) 
y{2n + 1) = 2- (l **-' V(n - d 2 )sv odd (n)J- <fc (61) 

The hardware may also compute the error signal, which may be used to adapt the 
15 software LMS update algorithm, as shown in equations (62) and (63). 

e(2n) = d(2n)- 2" toM x T (n - d 2 )w even (n) (62) 
e{2n + 1) = </(2n + l)- 2~ te -'V(/z - d 2 V"(/i) (63) 

where d(n) is the T/2 - spaced output of the RNRF filter and x(n) is the T-spaced 

20 output of the Tomlinson precoder, for example. 

FIG. 20 is a flowchart illustrating a software algorithm, according to an 
embodiment of the present invention where k is the coefficient index. The LMS 
algorithm may have various inputs, outputs, and storage factors. For example, storage 
factors may include an array of 256 64-bit locations to store the weight vector. The 

25 elements of this array may be initialized to zero. An input may include a step size 
parameter fi for determining the amount of right shift to perform on the averaged 
elements of the gradient vector. This may be changed by software at any time. Other 
inputs may include ECF delay; FIFO delay; number of symbols to process (e.g., N2*N1); 
number of gradient vectors to average, N2, before a weight vector is written to the ECF; 
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Tomlinson precoder output, x(n\ which may be a 16-bit number; and/or ECS (or AGC) 
output, e(ri). Other inputs may be provided. An output of the LMS algorithm may 
include weight vector, w(n). The upper 16 bits of the high precision coefficient values 
may be written to the ECF portion of the hardware block. 

5 As shown in FIG. 20, step 2010 indicates a start of the software algorithm. At 

step 2012, variables i, k, and j may be initialized to zero, where i is a block LMS index, k 
is a coefficient counter and j is a symbol counter. At step 2014, a correlation, as shown 
in the summation of equation (48) of variable i may be performed. At step 2016, a 
comparison or other process is performed to determine whether variable i is equal to N2 - 

10 1, where N2 indicates the number of gradient vectors to average. If variable i is equal to 
N2 - 1, the sum is shifted by a step size and the coefficient k is updated at step 2018, as 
also shown in the update of the weight vector of equation (48). Otherwise, variable i may 
be adjusted by a predetermined value (as shown by i++) and the correlation of i is again 
performed at step 2014. At step 2020, a comparison or other process is performed to 

15 determine whether k is equal to a predetermined constant, such as 255, wherein k may 
include a coefficient index. If it is determined that k is equal to 255 at step 2020, k is 
initialized (e.g., k is made equal to zero). At step 2022, it may be determined whether j is 
equal to Nl - 1, where Nl is the total number of symbols used for the current training 
session. If so, step 2024 indicates the end of the software algorithm. Otherwise, variable 

20 j may be adjusted by a predetermined value (as shown by j++) and correlation i may be 
performed at step 2014. If k does not equal 255 at step 2020, k may be adjusted by a 
predetermined value (as shown by k++) and i may be initialized (e.g., i is made equal to 
zero) where correlation of i may be performed, at step 2014. 

For example, a file blk__lmsupd_a.mip may contain the assembly code to perform 

25 the averaging of the gradient vector and the final coefficient update as well as overflow 
detection, in accordance with equation (48) above. The function may have the following 
inputs, 

blk _Jmsupd2( pointer to 64-bit coefficients, 
pointer to 16-bit input data, 
30 pointer to even error sample, 
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number of coefficients, 
step value, 

number of samples per block) 
The function may automatically update the coefficients and return an overflow 
5 flag. In addition, the hardware setup may involve the setting of control registers. 

Another embodiment of the present invention may be directed to determining an 
efficient estimation of a base-2 logarithm of a number. According to an embodiment of 
the present invention, an estimation of the log base-2 of a fixed-point binary number in 
either hardware or software may be implemented using a minimal number of parameters. 
10 Specifically, a single 2 nd order or greater polynomial may be sufficient to cover an entire 
range of input values for any arbitrary input word precision. Applications may include 
calculating a signal to noise ratio (SNR), bit error rate (BER), power in dB and any other 
application involving the calculation of a logarithm to any base. This aspect of the 
present invention may apply to ADSL, DSL, G.SHDSL and other types of 
15 communication. Further, this embodiment of the present invention may be applied at a 
CO, a CPE, via wireless transmission and other types of applications. 

The present invention may be implemented to estimate the base-2 logarithm of a 
number x, as shown in equation (64). 

JC = 2*=2 fc+ *' ) = 2*2* / (64) 
20 As seen in (64), the logarithm (exponent) k is the sum of an integer part, k t , and 

a fractional part (less than 1), k f9 as shown in equation (65). 

k = k t +k f (65) 

A rough approximation of the base-2 logarithm may be found using equation (66), 
which performs an exact calculation of the integer part and a linear approximation to the 
25 fraction. 

log 2 (x) = k t + k f = k t + ^ - 1 (66) 
where 

x 
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According to an embodiment of the present invention, a linear approximation of 

-A 

the fraction k f may be inserted into a polynomial function to estimate the fractional part 
of the log. 

The fractional part of the log may have the same (or similar) functional shape for 
5 some or all numbers. As a result, a single polynomial may be used for an entire range (or 
other specified range) of input values, as shown in FIGs. 21 and 22. 

When observed on a linear scale as in FIG. 21, the fractional portion of the log 
appears to have a different shape throughout the range of numbers. When the same is 
plotted with a log2 scale in the x-dimension as in FIG. 22, the shape of the function 
10 representing the fraction part of the log is the same (or similar) for most or all numbers, 
indicating that a single polynomial may be found. For example, the function representing 
the fractional part of the log between 2 12 - 2 13 is the function of the fractional part of the 
log between 2 13 ~2 14 , only decimated, which means it is equal to every other sample of 
the sequence spanning 2 13 -2 14 , for example. Likewise, the function of the fractional 
15 part is an up-sampled version of the function between 2 n - 2 12 . 

An example of a polynomial approximation to the fractional part is shown in 
equation (67). 

k f =k f =%a,(k f ) (67) 

1=0 

a x is thei* polynomial coefficient 
AT is the order of the polynomial 

20 The polynomial may be determined using any curve fitting method for outputting 

coefficients. For example, the higher the order of the polynomial, the better the 
approximation to the fractional part of the log. Experiments have shown an accurate 
estimate with a 2 nd or 3 rd order polynomial. According to an embodiment of the present 
invention, the final log may be shown below in equation (68). 

25 k = k t +k f =fc f + X fl A*/J ( 68 ) 

r=0 
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The result of equation (68) is an approximate log 2 of an integer number. If a 
numbering system assumes samples are fractional numbers between [- l,l) , then the log 
may be found by, 

logf* =fc (») 

5 Logarithms to any base may be found by multiplying (68) by a constant using 

various log conversion methods. 

The following plots (FIGs. 23-26) illustrate errors of different approximations in 
terms of dB, according to the present invention. The errors may be taken as the 
difference between the log base-2 using the matlab log2() function and the estimate of the 

10 log base-2, both converted to dB. 

FIG. 23 illustrates a comparison of the error in the linear approximation 2310, full 
matlab precision floating-point polynomial approximation 2312 and the reduced 
precision fixed-point polynomial approximation 2314. In this case, 8-bit polynomial 
coefficients and 8-bit polynomial inputs were used with a 2 nd order polynomial, 

15 producing a 16-bit result. 

FIG. 24 illustrates a comparison of the error in the linear approximation 2410, full 
matlab precision floating-point polynomial approximation 2412 and the reduced 
precision fixed-point polynomial approximation 2414. In this case, 8-bit polynomial 
coefficients and 8-bit polynomial inputs were used with a 3 rd order polynomial, 

20 producing a 16-bit result. 

FIG. 25 illustrates a comparison of the error in the linear approximation 2510, full 
matlab precision floating-point polynomial approximation 2512 and the reduced 
precision fixed-point polynomial approximation 2514, where 2512 and 2514 essentially 
overlap. In this case, 16-bit polynomial coefficients and 16-bit polynomial inputs were 

25 used with a 2 nd order polynomial, producing a 32-bit result. 

FIG. 26 illustrates a comparison of the error in the linear approximation 2610, full 
matlab precision floating-point polynomial approximation 2612 and the reduced 
precision fixed-point polynomial approximation 2614, where 2612 and 2614 essentially 
overlap. In this case, 16-bit polynomial coefficients and 16-bit polynomial inputs were 

30 used with a 3 rd order polynomial, producing a 32-bit result. 
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FIGs. 27 and 28 illustrate hardware implementation details, according to an 
embodiment of the present invention. The digital circuit of the present invention may be 
composed of at least two portions. The first portion may be implemented to determine an 

integer part k t and an estimate of the fractional part k f . The second portion may be used 

5 to implement the second order polynomial for a better estimate of the fractional part. 
Other higher order polynomials may also be implemented. 

FIG. 27 illustrates a block diagram of a digital circuit for generating an integer 
part and an estimate of fractional part of log-2, according to an embodiment of the 
present invention. When a new valid input data becomes available, as may happen when 
10 the data is written to a data bus at periodic intervals determined by a clock, for example, 
it may be loaded into a shift register 2710. At the same (or near the same) time, a counter 
13 2720 may be loaded with the total number of bits in the input data. If the most significant 

|S bit (MSB) of the shift register 2710 is zero, the data in the shift register 2710 may be left- 

shifted by one bit and the counter 2720 may be decremented by one. This process may 

ill 15 repeat until the MSB of the shift register 2710 is one. Then, the counter output may 

ill 

contain the integer part ^and the MSB's of the shift register 2710 may become the 
estimate of the fractional part k f . Other variations may be implemented. 

In FIG. 28 illustrates a block diagram of a digital circuit for implementing a second 

:f s order polynomial, according to an embodiment of the present invention. The second 

20 order polynomial may be implemented with simple hardware components (e.g., squaring 
device 2810, constant multipliers 2812, 2814 and adders 2816, 2818). Constant 
multipliers 2812, 2814 may be built with one or more shifters and/or adders. In an 
example, the circuit may not require any real variable-operand multipliers. Thus, the 
algorithm may be efficiently implemented with simple hardware. 
25 A squaring circuit 2810 (or other circuit corresponding to an order of the 

polynomial) may receive an estimate of a fractional part and generate a function of the 
estimate. A constant multiplier 2814 may receive the estimate of the fractional part and a 
second polynomial coefficient and generate a first output. A second constant multiplier 
2812 may receive the function of the estimate of squaring circuit 2810 and a third 
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polynomial coefficient and generate a second output. The first output of constant 
multiplier 2814 and the second output of second constant multiplier 2812 may be 
summed by adder 2816 for generating a first sum. A second adder 2818 may receive the 
first sum and a first polynomial coefficient for generating an improved estimate of the 
5 fractional part. 

The following routine is an example of an un-optimized assembly language 
implementation used to achieve one aspect of the present invention. The target processor 
in this case is a 32-bit fixed point MIPS KC4. A 2 nd order polynomial is used in this 
example. 

10 

#define COEFF0_H 0x0126 
[J #define COEFF1JH 0x548c 

0 #defme COEFF2 JI 0xd4e0 

r: » : 

n #define MULTLADJUST 1 

! n 15 #define BITS_PER_WORD 0x20 

ly 

1 3 .align 4 

J* log2_32: /* usage: Iog2_32(a0 = input__word , al = number_of_fractionaLbits) 



clz 


v0,a0 


/* counts leading zeros */ 


move 


t9,al 


/* load resolution value to t9 */ 


ori 


t8,r0XBITSJ>ERJWORD-l) 


sub 


vl,t8,v0 


/* finds integer exponent */ 


ori 


al,r0,0xl 


/* load a 1 into upper half word 


sll 


al,al,vl 


/* shift to msb position */ 


xor 


a0,a0,al 


/* mask out MSB */ 


sll 


a0,a0,v0 


/* left align remaining bits */ 


lui 


t0,COEFF0_H 


/* load first coeff */ 


lui 


a3,COEFFl_H 


/* load second coeff */ 


mult 


a3,a0 


/* peform first multiply */ 


mfhi 


a2 


/* result of first multiply */ 
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sll a2,a2,(MULTI_ADJUST) /* adjust for signed multiplication */ 



move 


t7,a2 




add 


t0,a2 


/* add to previous result */ 


lui 


a3,COEFF2_H 


/* load next coefficient */ 


mult 


a0,a0 


/* square sample */ 


mfhi 


a2 


/* load sample A 2 */ 


sll 


a2,a2,(MULTI_ADJUST) /* adjust for signed multiplication */ 


mult 


a2,a3 


/* multiply by coefficient */ 


mfhi 


a2 


/* load result */ 


sll 


a2,a2,(MULTI_ADJUST) /* adjust for signed multiplication */ 


add 


t0>a2 


/* accumulate to result */ 


add 


t0,t7 




S11V 


vl,vl,t9 


/* left align */ 


sub 


t9,t8,t9 


/* partial result */ 


srl 


t0,t0,t9 


/* right align */ 


or 


v0,vl,t0 


/* merge results */ 


jr 


ra 




nop 







20 Virata Corporation's Aluminum™ DSL PHY is designed for full duplex 

symmetric transmission over ordinary single twisted copper pair when used, for instance, 
with the Aluminum Analog Front End (AFE). This chipset supports programmable data 
rates ranging from 192Kbps to 4.6Mbps on a single pair, and provides reach greater than 
18,000 feet at 1.5Mbps. Loop-lengths of up to 26,000 ft are supported at lower data 

25 rates. 

The Aluminum™ chipset includes digital communications subsystems, which 
may include a combination of echo canceller, pre-coder, feed forward equalizer and 
decision feedback equalizer. Virata also offers its customers a comprehensive suite of 
technology solutions available to aid in the design, development and deployment of 
30 symmetric DSL products. This includes the BD3801 development reference platform for 



53 



PATENT 

Attorney Docket No.: 56162.000353 



the Aluminum™ chipset. Aluminum™ and the Aluminum™ AFE are 
HDSL2/G.shdsl/2B1Q Synchronous Digital Subscriber Line (SDSL) compliant. 

The Aluminum™ chipset provides customers with a data throughput increase of 
up to 100 percent over competitive G.shdsl solutions and enables the development of 

5 symmetric DSL products with lower power consumption, greater reach and higher 
performance than was previously possible. Aluminum™ and Aluminum™ AFE may be 
purchased as a bundle with Virata's Helium™ communications processor and 
comprehensive network protocol stack, creating a complete customer premises equipment 
solution for symmetric DSL gateways, routers, and integrated access devices (IAD). In 

10 support of this symmetric DSL chipset and software solution, Virata is also delivering a 
symmetric DSL to Ethernet router reference design. 

Helium™ is a low-cost, Physical Layer Device (PHY)-neutral communications 
processor that enables high-speed Internet access capability for single- and multiple-user 
endpoint devices such as Universal Serial Bus (USB) modems, home gateway devices 

15 and small office/home office (SOHO) routers. The Helium™ chip may be fully 
integrated with a networking and protocol software suite that handles Asynchronous 
Transfer Mode (ATM), frame, routing, bridging and signaling functions, as well as 
Simple Network Management Protocol (SNMP) management. 

FIG. 29 is a schematic diagram of a hardware architecture in which the inventive 

20 aspects of the present invention may be incorporated. The inventive concepts discussed 
above may be achieved with the processing aid of Million Instructions per Second 
(MIPS) 2910 shown in FIG. 29. The inventive concepts discussed above may be 
incorporated into chip sets, such as Virata Corporation's Aluminum™ 200 or 204 DSL 
Processor, which is also known as Virata's second generation symmetric high-speed DSL 

25 processor. Aluminum™ 200 or 204 may support several modes of operation including: 
International Telecommunications Union (ITU) G.991.2 (G.shdsl), American National 
Standard Institute (ANSI) T1E1.4 (High Speed Digital Subscriber Line (HDSL) 2) and 
single-pair 2B1Q SDSL. A solution for customer premises G.shdsl equipment, 
Aluminum™ 200 or 204 provides compliance with the ITU G.991.2 standard, as well as 

30 other standards. Data rates from 192 Kbps to 2.3 Mbps are supported on 8 Kbps 
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boundaries. In addition, Aluminum™ 200 or 204 provides at least three additional base 
data rates above 2.3 Mbps: 3.096 Mbps, 4.104 Mbps and 4.616 Mbps. 

The power spectral density (PSD) of the transmitted signal is programmable and 
supports defined symmetric and asymmetric PSDs. In addition, the adaptability of the 

5 PSD shaper may allow support of new PSDs that may be defined in the future. 

The Aluminum™ 200 or 204 DSL Processor may support Transmission Protocol 
Specific - Transmission Convergence (TPS-TC) defined in G.991.2 including dual-bearer 
mode. By providing at least two independent serial channels, in addition to a UTOPIA 
Level 2 port, the Aluminum™ 200 or 204 may provide services such as simultaneous 

10 Synchronous Transfer Mode (STM) voice and ATM data transport. 

Support for analog voice and G.shdsl on the same copper pair is provided through 
the Aluminum™ 200's or 204 ? s G.shdsl-over-POTS mode. By using this capability, 
vendors and other entities may deliver the reach and symmetric performance of G.shdsl 
without giving up POTS service. G.shdsl-over-POTS further works with current ADSL 

15 splitters and microfilters, as well as other devices and components. 

The Aluminum™ 200 or 204 DSL Processor may work in conjunction with 
Virata's Aluminum™ 200 Smart Analog Front End/Line Driver device, for example. 
The Aluminum™ 200 or 204 may control the Aluminum™ 200 AFE through a digital 
serial bus and may further provide for parameter calibration, power cutback and other 

20 functions. This configurability of the present invention allows the AFE to better match 
line conditions for higher performance and greater reach. 

The BD3802 is a development platform for Aluminum™ 200 or 204 DSL 
Chipset, providing a comprehensive set of hardware and firmware tools to assist users in 
rapid development and deployment of products and services. 

25 Product applications may include Symmetric DSL routers and Integrated Access 

Devices (IAD); DSL Access Multiplexers (DSLAMs); Multi-tenant and Multi-dwelling 
unit networks; Tl/El distribution products; and Tl/El pairgain systems (using 3 Mbps 
and higher data rates). 

Specification details may include ITU G.991.2 (G.shdsl) compliant; T1E1.4 

30 HDSL2 compliant; ETSL ETR-152 compliant (single pair); support for data rates, 
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presently, from 192 Kbps to 4616 Kbps on 8 Kbps increments; programmable framer 
supports G.shdsl, HDSL2, European Telecommunications Standard Institute (ETSI) 
SDSL, HDSL and transparent framing; UTOPIA Level 2 interface for ATM data and two 
independent serial interfaces for STM data; and 8-bit multiplexed or non-multiplexed 
5 host bus to connect to a variety of host Central Processing Units (CPUs). 

As shown in FIG. 29, MIPS Reduced Instruction Set Computing (RISC) engine 
and control registers 2910 may be coupled to a host interface 2922, which may in turn be 
coupled to a 8-bit host interface. A serial data input may be coupled to an input of Tx 
Framer TPS-TC 2912, which is further coupled to a Trellis Encoder/Mapper 2914. 
10 Trellis Encoder/Mapper 2914 may provide an input to Precoder 2916 where Precoder 
2916 may be coupled to a Tx Filter 2918. Tx Filter 2918 may be coupled to (sigma- 
IZ delta) S-D interpolating filter 2920 which is coupled to an output, Tx out. Utopia -2 

f J Interface 2924 may be coupled to an input of Tx Framer TPS-TC 2912 and may further 

Q receive data from a Rx Framer TPS-TC 2926. S-D Decimation filter 2938 receives an 

HI 15 input from Rx In and is coupled to an adder 2940. Adder 2930 sums outputs from filter 
fU 2938 and echo canceller 2932 and generates an output to Feed Forward (FF) equalizer 

p 2930. FF equalizer 2930 may be coupled to a Timing Recovery Phase Locked Loop 

[7 (PLL) 2936 at an input. At an output, Timing Recover PLL 2936 may be coupled to Volt 

ss=s 

10 Controlled Oscillator Digital Analog Converter (VCXO DAC), which in the case of the 

l n 20 STUR is used to adjust the sampling phase to match that of the transmitting modem. FF 
equalizer 2930 may be coupled to a Trellis Decoder 2928, which may be in turn coupled 
to Rx Framer TPS-TC 2926. In addition, in the case of the Aluminum™ 204, a SNR 
margin 2942 may be coupled to FF equalizer 2930 and Rx Framer TPS-TC 2926, which 
may be further coupled to a Serial Data output. 
25 In particular, Aluminum™ 204 may include SNR margin 2942, as discussed 

above and as shown in FIG. 29. Details of SNR margin 2942 are shown in FIG. 30. A 
purpose of a SNR-margin estimator may include determining SNR for a given 
constellation and input signal. The output of the estimator may include the SNR margin 
value in dB. The SNR-margin estimator may include a hardware module that performs 
30 the SNR computation. Software may select certain parameters based on the constellation 
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type. FIG. 30 illustrates a block diagram showing the basic operation of a SNR-margin 
estimator. 

A hardware module may perform a SNR-margin estimate that operates on a 
continuous stream of input samples and produces a SNR value. The margin may be 
5 found by comparing SNR to a minimum value. The software may perform a function of 
specifying the number of samples to average over as well as the signal power for a 
particular constellation. 

The error signal may be computed by subtracting an input sample from a decoder 
or slicer output as shown in equation (70). 
10 e(n) = d{n)-x{n-K) (70) 

For example, slicer or decoder 3010 may receive an input signal x(n) to generate 
an output of d(n). In addition, input signal x(n) may be an input to function 3012 (e.g., z" 
K ) for generating an output to be subtracted from d(n) by adder 3014. The output of 
adder 3014 may include an error signal as defined in equation (70) above. 
15 K may be defined as the delay through the decoder or slicer and d(n) may be a 

soft-decision output of Trellis decoder or an output of a slicer 3010. This error signal 
may be equal to the noise in the signal if the assumption is made that the decision, d(n) 9 
is correct. This leads to a mean-squared-error (MSE) being approximately equal to the 
noise power as shown in equation (71). 

20 E[e\ni=MSE = ^{d{n)-x{n-K)) = P n (71) 

N n=0 



SNR = 10\og 



10 



p. \ ... 



p. 



= 101og 



\mse 



(72) 



= 101og 10 (P s )-101og 10 (MS£) 
Function 3016 may receive error signal e(n) and generate a square of the error 
signal e 2 (n). Function 3018 may receive squared error signal and generate MSE. In 
particular, function 3018 may accumulate N values and right shift by log2(N). 
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The signal power for a given constellation is known, so the first term on the right 
of equation (72) above is a constant and the MSE dB value may be computed. For the 
log estimation, the following may be used. The desire is to estimate log 2 (MSE), 

MSE = 2 k =:2 {K+kf) = 2 k '2 kf (73) 
5 where the exponent k is the sum of an integer part, k i , and a fractional part (less than 1), 

kf , as shown below. 

k = k t +k f (74) 

A rough approximation of log 2 (M5£') may be found using the following equation, 

log 2 (MS£) = k. + k f = k t + 4" - 1 (75) 

2 

10 Where k t is found by noting the position of the most significant bit (MSB). Function 
3020 may receive MSE and generate lOlogio(MSE). In particular, the estimate of the 
fraction, k f , may then be improved by estimating the logarithms nonlinear fractional 

component using equation (76). 

k f =0.0090+1.321% -0.3369*J (76) 
15 The result is the approximate log 2 (MSE) of an integer number. To find 101og 10 (M5£'), 
the scaling operation in equation (77) may be performed. 

101og 10 (MS£ )= 101og 10 (2)log 2 (M5£) (77) 

Finally, the SNR margin may be computed using equation (78). 

SNR^^SNR-SNR^ (78) 

20 Adder 3022 may be used to calculate SNR margin by subtracting the result of function 
3020 from 101ogio(P s ) - SNR m i n . The following precisions may be assumed, such as 
Polynomial coefficients are 10-bits {0x002, 0x152, 0x3aa}; power samples are 16-bits; 

the rough estimate of the log fraction, k f , is minimum 8-bits; the resulting log is 16 bits, 

6 integer and 10 fractional bits. 
25 An example may include the following log estimation where 

log 2 (0000000010101010fo) maybe calculated. 
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The bit location of the first 1 is 7 so this means ^ =7 = 111. To estimate k f , the 

MSB is removed leaving 0101010b, which is the rough estimate of the fractional part of 
the log. The rough estimate of log 2 is then 111.0101010, which is 7.328125. k f = 

(0.0090)+ (1.3211)(0.328125) + (0.3369)(0.328125) 2 . Therefore, jfcs 7.4062 To find 
5 101og 10 (00000000101010102>), 101og 10 (2)** = 22.2949. The actual result calculated 

with a calculator is 22.304. 

Several parameters may be implemented by software for the proper operation of 
the SNR-margin estimator. The number of samples to average over, N, may be input by 
software. This is input as log 2 (Af) and hardware will set the appropriate bit in a counter 

10 and then use the input value for the final shift. Another parameter may include the 
logarithm of the signal power. This value may be constellation specific and may be a 
parameter input by software even if the module works with one constellation. Yet 
another parameter may include the minimum SNR acceptable, SNR^ . 

FIG. 31 illustrates a flowchart describing a hardware algorithm for SNR margin, 

15 according to an embodiment of the present invention. At step 31 10, a start of a hardware 
algorithm for SNR margin may be initiated. Variables, such as n and sum may be 
initialized. At step 3112, an error signal, the square of the error signal and an 
accumulated result may be determined. At step 31 14, it may be determined whether n = 
N-l wherein N may be a summation length. If not, variable n may be adjusted by a 

20 predetermined value (as shown by n++) where step 3112 may be invoked. Otherwise, a 
logarithm may be determined and further scaled, at step 3116. At step 3118, the margin 
may be determined by subtracting a signal. Step 3120 indicates the end of the algorithm. 

An input to the hardware algorithm may include summation length, N. This 
number may be a power of 2 and may range from approximately 64 to approximately 

25 32768. Summation length may be input as log 2 (Af). Other inputs may include 
constellation type, 101og 10 (pJ and SNR mn , which represents a minimum SNR to obtain a 
specific BER, for example. Storage factors may include log 2 (iV max )+ 22 = 15 + 22 = 37 
bit accumulator to store the power sum. This accumulator may be cleared to zero every 
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time a new average is started. Other storage factors may include K+l, where K 
represents the delay from a Trellis decoder input to output, samples of the input signal, 
12-bits each, for example. The output may include SNR margin, which may include five 
integer bits. This means that a maximum value this output may have is approximately 31 
5 dB. This may be compared to a threshold of 0-15 dB to determine if the margin is too 
small. The 16-bit SNR value may be subtracted from the reference, which may also be 
16 bits. The value may then be rounded and the lower 5 integer bits compared to the 
threshold. 

FIG. 32 is a schematic diagram of a hardware architecture which may function 

10 with devices supporting certain inventive aspects of the present invention, AFE, such as 
the Aluminum™ 200 AFE, is a G.SHDSL/HDSL2/2B1Q SDS1 compliant Analog Front 
End (AFE) with integrated line driver designed to be used with Virata's Aluminum™ 
200 or 204 Symmetric DSL Processor to an external 2/4 wire hybrid. 

The Aluminum™ 200 AFE conforms to G.shdsl PSD masks for every rate when 

15 interfaced to the Aluminum™ 200 or 204 DSL Processor, Aluminum™ 200 AFE also 
conforms to the HDSL2 OPTIS PSD mask at 1.544 Mbps. Aluminum™ 200 AFE may 
be used in a central office or remote application mode, selectable by configuring the 
programmable filters in the Aluminum™ 200 or 204 DSL Processor. 

The Aluminum™ 200 AFE may include a high resolution 16-bit TX Digital 

20 Analog Converter (DAC) in the transmit path and one high resolution 16-bit RX Analog 
Digital Converter (ADC) in the receive path. A 10-bit DAC for the VCXO control is also 
integrated in the Aluminum™ 200 AFE to reduce the number of required external 
components. The transmitter programmable attenuation control (PAC) and the receiver 
programmable gain amplifier (PGA) may be programmed via the Aluminum™ 200 or 

25 204 processor through a two-wire serial bus. 

Aluminum™ 200 AFE has a low total power consumption of less than 800 mWatt 
(including the line drive) in full operation mode. An external line driver may also be 
used for HDSL2 and asymmetric PSD applications. Aluminum™ 200 AFE may also 
provide a power down mode for stand-by operation. 
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Product applications may include symmetric DSL routers and integrated access 
devices; DSL access multiplexers (DSLAMs); multi-tenant and multi-dwelling unit 
networks; Tl/El distribution products; and Tl/El pairgain systems (using proprietary 3 
Mbps and higher data rates). Specification details may include ITU G.991.2 (G.shdsl) 

5 compliant; T1E1.4 HDSL2 compliant; ETSI ETR-152 compliant (single pair); and 
support for data rates from 192 Kbps to 2.312 Mbps and 8Kbps increments, plus three 
additional rates of 3.096 Mbps, 4.104 Mbps and 4.616 Mbps. 

As shown in FIG. 32, a transmission line may include a Tx Digital Audio Video 
(DAV) 3210 coupled to a Tx Filter 3212, further coupled to a Programmable Attenuation 

10 Control (PAC) 3214. PAC 3214 is coupled to a Line Driver 3216, which is coupled to a 
Tx Tip & Ring. PAC 3214 may be further coupled to a HDSL2 bypass. A receiving line 
includes Automatic Gain Control (AGC) 3218 coupled to Rx Tip & Ring at an input and 
Rx Filter 3220 at an output. Rx Filter 3220 is coupled to Rx ADC 3222. A 
control/testing interface 3226 may be provided on a serial line. A clocking subsystem 

15 3224 may receive a plurality of inputs. The hardware architecture of FIG. 32 may further 
include a Current and Voltage Reference Generator (IV-REF) Subsystem 3228. 

While the foregoing description includes many details and specificities, it is to be 
understood that these have been included for purposes of explanation only, and are not to 
be interpreted as limitations of the present invention. Many modifications to the 

20 embodiments described above can be made without departing from the spirit and scope of 
the invention. 

The present invention is not to be limited in scope by the specific embodiments 
described herein. Indeed, various modifications of the present invention, in addition to 
those described herein, will be apparent to those of ordinary skill in the art from the 

25 foregoing description and accompanying drawings. Thus, such modifications are 
intended to fall within the scope of the following appended claims. Further, although the 
present invention has been described herein in the context of a particular implementation 
in a particular environment for a particular purpose, those of ordinary skill in the art will 
recognize that its usefulness is not limited thereto and that the present invention can be 

30 beneficially implemented in any number of environments for any number of purposes. 
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Accordingly, the claims set forth below should be construed in view of the full breath and 
spirit of the present invention as disclosed herein. 
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